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Abstract. — Most physical systems are modelled by an ordinary or a partial differential 
equation, like the n-body problem in celestial mechanics. In some cases, for example when 
studying the long term behaviour of the solar system or for complex systems, there exist 
elements which can influence the dynamics of the system which are not well modelled 
or even known. One way to take these problems into account consists of looking at 
the dynamics of the system on a larger class of objects, that are eventually stochastic. 
In this paper, we develop a theory for the stochastic embedding of ordinary differential 
equations. We apply this method to Lagrangian systems. In this particular case, we 
extend many results of classical mechanics namely, the least action principle, the Euler- 
Lagrange equations, and Noether's theorem. We also obtain a Hamiltonian formulation 
for our stochastic Lagrangian systems. Many applications are discussed at the end of the 
paper. 
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INTRODUCTION 



Ordinary as well as partial differential equations play a fundamental role in most parts 
of mathematical physics. The story begins with Newton's formulation of the law of attrac- 
tion and the corresponding equations which describe the motion of mechanical systems. 
Regardless the beauty and usefulness of these theories in the study of many important 
natural phenomena, one must keep in mind that they are based on experimental facts, 
and as a consequence are only an approximation of the real world. The basic example we 
have in mind is the motion of the planets in the solar system which is usually modelled by 
the famous n-body problem, i.e. n points of mass nii which are only submitted to their 
mutual gravitational attraction. If one looks at the behaviour of the solar system for finite 
time then this model is a very good one. But this is not true when one looks at the long 
term behaviour, which is for instance relevant when dealing with the so called chaotic 
behaviour of the solar system over billions years, or when trying to predict ice ages over 
a very large range of time. Indeed, the n-body problem is a conservative system (in fact 
a Lagrangian system) and many non-conservative effects, such as tidal forces between 
planets, will be of increasing importance along the computation. These non-conservative 
effects push the model outside the category of Lagrangian systems. You can go further 
by considering effects due to the changing in the oblateness of the sun. In this case, we 
do not even know how to model such kind of perturbations, and one is not sure of staying 
in the category of differential equations^ 1 ) . 



( 'Note that in the context of the solar system we have two different problems: first, if one uses only 
Newton's gravitational law, one must take into account the entire universe to model the behaviour of the 
planets. This by itself is a problem which can be studied by using the classical perturbation theory of 
ordinary differential equations. This is different if we want to speak of the "real" solar system for which 
we must consider effects that we ignore. In that case, even the validation of the law of gravitation as a 
real law of nature is not clear. I refer to [16] for more details on this point. 
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As a first step, this paper proposes tackling this problem by introducing a natural 
stochastic embedding procedure for ordinary or partial differential equations. This con- 
sists of looking for the behaviour of stochastic processes submitted to constraints induced 
by the underlying differential equation^ 2 ** . We point out that this strategy is different from 
the standard approach based on stochastic differential equations or stochastic dynamical 
systems, where one gives a meaning to ordinary differential equations perturbed by a 
small random term. In our work, no perturbations of the underlying equation are carried 
out. 

A point of view that bears some resemblance to ours is contained in V.I. Arnold's 
materialization of resonances ([6], p. 303-304), whose main underlying idea can be briefly 
explained as follows: the divergence of the Taylor expansion of the arctanx function 
at for | x |> 1 can be proved by computing the coefficients of this series. However, 
this does not explain the reason for this divergence behaviour. One can obtain a better 
understanding by extending the function to the complex plane and by looking at its 
singularities at ±i. The same idea can be applied in the context of dynamical systems. 
In this case, we look for the obstruction to linearization of a real systems in the complex 
plane. Arnold has conjectured that this is due to the accumulation of periodic orbits in the 
complex plane along the real axis. In our case, one can try to understand some properties 
of the trajectories of dynamical systems by using a suitable extension of its domain of 
definition. In our work, we give a precise sense to the concept of differential and partial 
differential equations in the class of stochastic processes. This procedure can be viewed 
as a first step toward the general "stochastic programme" as described by Mumford in [51]. 

Our embedding procedure is based on a simple idea: in order to write down differential 
or partial differential equations, one uses derivatives. An ordinary differential equation 
is nothing else but a differential operator of order one^ 3 ). In order to embed ordinary 
differential equations, one must first extend the notion of derivative so that it makes 
sense in the context of stochastic processes. By extension, we mean that our stochastic 
derivative reduces to the classical derivative for deterministic differentiable processes. 
Having this extension, one easily defines in a unique way, the stochastic analogue of a 
differential operator, and as a consequence, a natural embedding of an ordinary differential 



^'This strategy is part of a general programme called the embedding procedure in [15] and which can be 
used to embed ordinary differential equations not only on stochastic processes but on general functional 
spaces. A previous attempt was made in [13], [14] in the context of the non-differentiable embedding of 
ordinary differential equations. 
' 3 ^In this case, we can also speak of vector fields. 
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equation on stochastic processes. 

Of course, one can think that such a simple procedure will not produce anything new 
for the study of classical differential equations. This is not the case. The main problem 
that we study in this paper is the embedding of natural Lagrangian systems which are 
of particular interest for classical mechanics. In this context, we obtain some numerous 
surprising results, from the existence of a coherent least action principle with respect to 
the stochastic embedding procedure, to a derivation of a stochastic Noether theorem, 
and passing by a new derivation of the Schrodinger equation. All these points will be 
described with details in the following. 

Two companion papers ([18], [9]) give an application of this method to derive new 
results on the formation of planets in a protoplanetary nebulae, in particular a proof of 
the existence of a so called Titus-Bode law for the spacing of planets around a given star. 

The plane of the paper is as follow: 

In a first part, we develop our notion of a stochastic derivative and study in details all 
its properties. 

Chapter 1 gives a review of the stochastic calculus developed by Nelson [53]. In partic- 
ular, we discuss the classical definition of the backward and forward Nelson derivatives, 
denoted by D and Z)*, with respect to dynamical problems. We also define a class of 
stochastic process called good diffusion processes for which one can compute explicitly 
the Nelson derivatives. 

In Chapter 2 we define what we call an abstract extension of the classical derivative. Us- 
ing the Nelson derivatives, we define an extension of the ordinary derivative on stochastic 
processes, which we call the stochastic derivative. As pointed out previously, one imposes 
that the stochastic derivative reduces to the classical derivative on differentiable determin- 
istic processes. This constraint ensures that the stochastic analogue of a PDE contains the 
classical PDE. Of course such a gluing constraint is not sufficient to define a rigid notion 
of stochastic derivative. We study several natural constraints which allow us to obtain a 
unique extension of the classical derivative on stochastic processes as 



(0.1) 
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By extending this operator to complex valued stochastic processes, we are able to define 
the iterate of T>, i.e. T> 2 = T> o T> and so on. The main surprise is that the real part of T> 2 
correspond to the choice of Nelson for acceleration in his dynamical theory of Brownian 
motion. However, this result depends on the way we extend the stochastic derivative to 
complex valued stochastic processes. We discuss several alternative which covers well 
known variations on the Nelson acceleration. 

In Chapter 3 we study the product rule satisfied by the stochastic derivative which 
is a fundamental ingredient of our stochastic calculus of variation. We also introduce 
an important class of stochastic processes, called Nelson differentiable, which have the 
property to have a real valued stochastic derivative. These processes play a fundamental 
role in the stochastic calculus of variation as they define the natural space of variations 
for stochastic processes. 

The second part of this article deals specifically with the definition of a stochastic 
embedding procedure for ordinary differential equations. 

Chapter 4 associate to a differential operator of a given form acting on sufficiently 
regular functions a unique operator acting on stochastic processes and defined simply by 
replacing the classical derivative by the stochastic derivative. This is this procedure that 
we call the stochastic embedding procedure. Note that the form of this procedure acts 
on differential operators of a given form. Although the procedure is canonical for a given 
form of operator, it is not canonical for a given operator. 

The previous embedding is formal and does not take constraints which are of dynamical 
nature, like the reversibility of the underlying differential equation. As reversibility plays a 
central role in physics, especially in celestial mechanics which is one domain of application 
of our theory, we discuss this point in details. We introduce an embedding which respect 
the reversibility of the underlying equation. Doing this, we see that we must restrict 
attention to the real part of our operator, which is the unique one to possess this property 
in our setting. We then recover under dynamical and algebraic arguments studies dealing 
with particular choice of stochastic derivatives in order to derive quantum mechanics from 
classical mechanics under Nelson approach. 

The third part is mainly concerned with the application of the stochastic embedding 
to Lagrangian systems. 
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We consider autonomous^ Lagrangian systems L(x,v), (x,v) G U C R d x M. d , where 
U is an open set, which satisfy a number of conditions, one of it being that it must 
be holomorphic with respect to the second variable which represent the derivative of 
a given function. Such kind of Lagrangian functions are called admissible. Using the 
stochastic embedding procedure we can associate to the classical Euler-Lagrange equation 
a stochastic one which has the form 



— (X(t),VX(t))=V 
where X is a real valued stochastic process. 



— (X(t),VX(t)) 



(SEL) 



At this point, our manipulation is only formal and one can ask if this embedding is 
significant or not. We then remark that the Lagrangian function L keep sense on stochastic 
processes and can be considered as a functional. As a consequence, we can search for the 
existence of a least action principle which gives the stochastic Euler-Lagrange equation 
(SEL). The existence of such a stochastic least action principle is far from being trivial with 
respect to the embedding procedure. Indeed, it must follows from a stochastic calculus of 
variations which is not developed apart from this procedure. Our problem can then be 
formalize as the following diagram: 



L(x,dx/dt) — — — > EL 



(0.2) 



s 



s 



L(X,VX) SLAP ? ) (SEL), 



where LAP is the least action principle, S is the stochastic embedding procedure, (EL) 
is the classical Euler-Lagrange equation associated to L and SLAP the at this moment 
unknown stochastic least action principle. The existence of such a principle is called the 
coherence problem. 

Chapter 7 develop a stochastic calculus of variations for functionals of the form 

rb 



(0.3) E 



L{X{t),VX(t))dt 



where E denotes the classical expectation. Introducing the correct notion of extremals 
and variations we obtain two different stochastic analogue of the least action principle 
depending on the regularity class we choose for the admissible variations. The main 
point is that for variations in the class of Nelson differentiable process, the extremals 
of our functional coincide with the stochastic Euler-Lagrange equation obtained via the 
stochastic embedding procedure. This result is called the coherence lemma. In the 



^This restriction is due to technical difficulties. 
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reversible case, i.e. taking as a stochastic derivative only the real part of our operator, we 
obtain the same result but in this case one can consider general variations. 

In chapter 8 we provide a first study of what dynamical data remain from the classical 
dynamical system under the stochastic embedding procedure. We have focused on 
symmetries of the underlying equation and as a consequence on first integrals. We prove 
a stochastic analogue of the Noether theorem. This allows us to define a natural notion 
of first integral for stochastic differential equations. This part also put in evidence the 
need for a geometrical setting governing Lagrangian systems which is the analogue of 
symplectic manifolds. 

Chapter 9 deals with the stochastic Euler-Lagrange equation for natural Lagrangian 
systems, i.e. associated to Lagrangian functions of the form 

(0.4) L(x,v)=T(v)-U(x), 

where U is a smooth function and T is a quadratic form. In classical mechanics U 
is the potential energy and T the kinetic energy. The main result of this chapter is 
that by restricting our attention to good diffusion processes, and up to a a well chosen 
function ip, called the wave function, the stochastic Euler-Lagrange equation is equivalent 
to a non linear Schrodinger equation. Moreover, by specializing the class of stochastic 
processes, we obtain the classical Schrodinger equation. In that case, we can give a very 
interesting characterization of stochastic processes which are solution of the stochastic 
Euler-Lagrange equation. Indeed, the square of the modulus of ip is equal to the density 
of the associated stochastic process solution. 

In chapter 10, we define a natural notion of stochastic Hamiltonian system. This result 
can be seen as a first attempt to put in evidence the stochastic analogue of a symplectic 
structure. We define a stochastic momentum process and prove that, up to a suitable 
modification of the stochastic embedding procedure called the Hamiltonian stochastic em- 
bedding, and reflecting the fact that the "speed" of a given stochastic process is complex, 
we obtain a coherent picture with the classical formalism of Hamiltonian systems. This 
first result is called the Legendre coherence lemma as it deals with the coherence between 
the Hamiltonian stochastic embedding procedure and the Legendre transform. Secondly, 
we develop a Hamilton least action principle and we prove again a coherence lemma, i.e. 
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that the following diagram commutes 



Sh 



H(x(t),p(t))—^H(X(t),P(t)) 
Hamilton least action principle Stochastic Hamilton least action principle 

(HE) — — (SHE) 

where Sh denotes the Hamiltonian stochastic embedding procedure. 



The last chapter discuss many possible developments of our theory from the point of 
view of mathematics and applications. 



PART I 



THE STOCHASTIC DERIVATIVE 



CHAPTER 1 



ABOUT NELSON STOCHASTIC CALCULUS 



1.1. About measurement and experiments 

In this section, we explain what we think are the basis of all possible extensions of the 
classical derivative. The setting of our discussion is the following: 

We consider an experimental set-up which produces a dynamics. We assume that each 
dynamics is observed during a time which is fixed, for example [0, T], where T G For 
each experiment i, i G N, we denote by Xi(t) the dynamical variable which is observed 
for t G [0,T]. 

Assume that we want to describe the kinematic of such a dynamical variable. What is 
the strategy ? 

The usual idea is to model the dynamical behaviour of a variable by ordinary differential 
equations or partial differential equations. In order to do this, we must first try to have 
access to the speed of the variable. In order to compute a significant quantity we can 
follow at least two different strategies: 



- We do not have access to the variable -Xj(i), t G [0, T], but to a collection of mea- 
surements of this dynamical variable. Assume that we want to compute the speed at 
time t. We can only compute an approximation of it for a given resolution h greater 
than a given threshold ho- Assume that for each experiment we are able to compute 
the quantity 

nn „ rfl _ *»(* + ft) -*»(*) 

I 1 - 1 ,) Vi,h[t) Jl • 

We can then try to look for the behaviour of this quantity when h varies. If the 
underlying dynamics is not too irregular, then we can expect a limit for Vi^(t) when 



22 



CHAPTER 1. ABOUT NELSON STOCHASTIC CALCULUS 



h goes to zero that we denote by Vi(t). 



We then compute the mean value 



(1.2) 



v 



(*) = -E«*(*)- 



i=i 



If the underlying dynamics is not too irregular then v(t) can be used to model the 
problem. In the contrary the basic idea is to introduce a random variable. 

Remark that due to the intrinsic limitation for h we never have access to v^it) so 
that this procedure can not be implemented. 
- Another idea is to look directly for the quantity 



Contrary to the previous case, if there exists a well defined mean value Vh(t) when 
n goes to infinity then we can have a as close as we want approximation. Indeed it 
suffices to do sufficiently many experiences. We then look for the limit of Vh{t) when 
h goes to zero. 

For regular dynamics these two procedures lead to the same result as all these quantities 
are well defined and converge to the same quantity. This is not the case when we deal 
with highly irregular dynamics. In that case the second procedure is easily implemented 
contrary to the first one. The only problem is that we loose the geometrical meaning of 
the resulting limit quantity with respect to individual trajectories as one directly take a 
mean on all trajectories before taking the limit in h. 

This second alternative can be formalized using stochastic processes and leads to the 
Nelson backward and forward derivatives that we define in the next section. 

We have take the opportunity to discuss these notions because the previous remarks 
proves that one can not justify the form of the Nelson derivatives using a geometrical argu- 
ment like the non differentiability of trajectories for a Brownian motion. This is however 
the argument used by E. Nelson ([54], p. 1080) in order to justify the fact that we need a 
substitute for the classical derivative when studying Wiener processes. This misleadingly 
suggest that the forward and backward derivative capture this non differentiability in their 
definition, which is not the case. 



(1.3) 




i=i 
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1.2. The Nelson derivatives 



Let X(t), ^ t ^ 1 be ti-dimensional continuous random process denned on a prob- 
ability space (fl,A,P), where A is the cr-algebra of all measurable events and P is a 
probability measure defined on A. We denote by I the open interval (0, 1). 

Definition 1.1. — The random process X(t), a ^ t ^ b, is an SO-process if each X(t) 
belongs to L 1 ^) and the mapping t X(i) from K to is continuous. 

Let V = {Vt} and T = {Ft} be an increasing and a decreasing family of sub-cr-algebras, 
respectively, such that X(t) is ^-measurable and "P^-measurable. In other words, T and 
V are two filtration to which X(t) is adapted. We let E[» \ B\ denote the conditional 
expectation with respect to any sub-a-algebra B C A. 

Definition 1.2. — The random process X{t), a ^ t ^ b, is an Sl-process if it is an 
SO-process such that 

~X(t + h)-X(t) 



(1.4) 
and 
(1.5) 



DX(t) 



lim E 



h 



D*X(t) = lim E 



X(t) - X(t - h) 
h 



exist in L 1 (f2) and the mappings t h-> DX{t) and t ^ D*X(t) are both continuous from 
to L\Q). 



Definition 1.3. 

Sl-process, and 

(1.6) 

and 

(1.7) 

exist in L 1 (0). 



The random process X{t), a ^ t ^ b, is an S2-process if it is an 
\X(t + h)-X{t)f ^ 



a 2 X(t) 



^X(t) 



lim E 



lim E 



h 



{X{t + h)~ X{t)f 
h 



Definition 1.4- - - We denote by C 1 (7) the totality of S2-processes with continuous sam- 
ple paths, such that X(t), DX(t) and D*X(t), a ^ t ^ b, all lie in the Hilbert space L 2 {Q) 
and are continuous functions of t in L 2 {Q). 
A completion ofC l {I) in the norm 



(1.8) 



X ||= sup(|| X(t) \\ L 2 (n) + || DX(t) \\ L 2 (n) + || D*X(t) || L2(Q) ), 



tei 



is also denoted by C 1 (I), where \\ . denotes the norm of Hilbert space L 2 (Q). 
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Remark 1.1. — The main point in the previous definitions for a forward and backward 
derivative of a stochastic process, is that the forward and backward filtration are fixed by the 
problem. As a consequence, we have not an intrinsic quantity only related to the stochastic 
process. A possible alternative definition is the following: 

Definition 1.5. — Let X be a stochastic process, and a(X) (resp. a^(X)) the forward 
(resp. backward) adapted filtration. We define 

(1.9) rsX{t) = lim hT x E[X(t + h) - X(t) \ a{X s , < s < t)], 

h— +0+ 

(1.10) t>*X(t) = lim h^EiXit) - X(t - h) | a{X s ,t s: s < 1)]. 

In this case, we obtain intrinsic quantities, only related to the stochastic process. 
However, these new operators behave very badly from an algebraic view point. Indeed, 
without stringent assumptions on stochastic processes, we do not have linearity of b or . 

This difficulty is not apparent as long as one restrict attention to a single stochastic 
process. 



1.3. Good diffusion processes 

We introduce a special class of diffusion processes for which we can explicitly compute 
the derivative D, D*, DD*, D*D, D 2 and D 2 . 

Definition 1.6. - - We denote by the space of diffusion processes X satisfying the 
following conditions: 

i- X solves a stochastic differential equation : 

(1.11) dX(t) = b(t, X(t))dt + a(t, X(t))dW(t), X(0) = X , 

where X G L 2 (Vl), b : [0, T]xR d ^ R d and a : [0, T]xR d ^ l d ®l d are Borel measurable 
functions satisfying the hypothesis : there exists a constant K such that for every x, y 6 M. d 
we have 

(1.12) sup (\a(t, x) - a(t, y)\ + \b(t, x) - b(t, y)\) ^ K \x - y\ , 

t 

(1.13) sup(\a(t,x)\ + \b(t,x)\) ^K(l + \x\). 

t 

ii- For any t > 0, X(t) has a density pt(x) at point x. 



1.4. THE NELSON DERIVATIVES FOR GOOD DIFFUSION PROCESSES 
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Hi- Setting aij = (aa*)ij, for any i € {1, ■ • • , n}, for any to > 0, for any bounded open set 
D c R d , 

(1.14) I [ \dj(aij(t,x)p t (x))\dxdt < +oo. 

./to •/ D 



1 

Pt(z)' 



ro- 6 and (t,x) — ► - / ^ dj(aij(t,x)pt(x)) are continuous and bounded functions. 



Remark 1.2. - — Hypothesis Hi) ensures that (1.11) has a unique t— continuous so- 
lution X{t). 

— Hypothesis i), ii) and Hi) allow to apply theorem 2.3 p.217 in [49]. 

— We may wonder in which cases hypothesis ii) holds. Theorem 2.3.2 p.lll of [58] 
gives the existence of a density for all t > under the Hormander hypothesis which 
is involved by the stronger condition that the matrix diffusion era* is elliptic at any 
point x. A simple example is given by a SDE where b is a C°°(I x M. d ) function with 
all its derivatives bounded, and where the diffusion matrix is a constant equal to eld. 
In this case, pt{x) belongs to C°°(I x R. d ); moreover, if Xq has a differentiate and 
everywhere positive density po{x) with respect to Lebesgue measure such that po(x) 
and po(x)~ 1 Vpo(x) are bounded, then b(t,x) — cVlog(pt(x)) is bounded as noticed in 
the proof of proposition 4.1 in [64]. So hypothesis ii) seems not to be such a restrictive 
condition. 

— Assumption iv) is necessary to compute explicitly the second order operators of D 
and D* . The existence of D and is ensured under a weaker condition, the finite 
entropy condition equivalent to 



(1.15) E 



[\b(t, 
Jo 



X{t)fdt 



< oo. 



We refer to Follmer ([25], proposition 2.5 p. 121 and lemma 3.1 p. 123) for more 
details. 

According to the theorem 2.3 of [49] and thanks to iv), we will see that C C 1 ([0, T]) 
and that we can compute DX and D*X for X G (see Theorem 1.1). 



1.4. The Nelson derivatives for good diffusion processes 

A useful property of good diffusions processes is that their Nelson's derivatives can be 
explicitly computed. Precisely, we have: 
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Theorem 1.1. — Let X G A d which writes dX(t) = b(t, X(t))dt+a(t, X(t))dW(t). Then 
X is Markov diffusion with respect to an increasing filtration (Vt) and a decreasing filtra- 
tion (J-'t)- Moreover, DX and D^X exists w.r.t. these filtration and : 

(1.16) DX(t) = b(t,X(t)) 

(1.17) D*X(t) = K(t,X(t)) 

where x — > pt(x) denotes the density of X(t) at x and 

b\(t,x) = b%x) - ■^~-d j (a i \t,x)p t (x)) 
with the convention that the term involving ^|^y is if Pt(x) = 0. 

Proof. — The proof uses essentially theorem 2.3 of Millet-Nualart-Sanz [49] and the 
techniques of M. Thieullen for the proof of proposition 4.1 in [64]. 



(1) Let X £ Ad. Then X is a Markov diffusion w.r.t. the increasing filtration (Vt) 
generated by the Brownian Motion Wit) and so : 



E 



X(t + h)- X(t) 



and 



E 



E 



X(t + h)-X(t) 
h 



h 



Wt 







= E 


hj t 





"t+h 



b(s,X(s))ds\V t 



b(t,X(t)) 



< E 



h 



t+h 



\b(s,X(s))-b(t,X(t))\ds 



We can apply the dominated convergence theorem since b is bounded and 

rt+h 



i rt+ri 

- / |6(s, X(s)) - b(t, X(t))\ ds^O a.s. 
h Jt 



(for b is continuous and X has a.s. continuous paths). 



Therefore DX exists and DX(t) = b(t,X(t)). 



(2) As I £ Aa, we can apply theorem 2.3 in [49]. So X(t) = X(l — t) is a diffusion 
process w.r.t. an increasing filtration (Vt) and whose generator reads L t f = Vdif + 
\a} j dijf with 5*^(1 -t,x) = a»(t,x) and 6*(1 -t,x) = -b\t,x) + — —dj(a ij (t, x)p t (x)). 
Setting Tt = Vi-t, X is a Markov diffusion w.r.t. the decreasing filtration (J-'t)- We have 



E 



X(t) - X(t - h) 
h 



(1.18) 



= E 
= -E 



X(l-t)-X(l-t + h) . 
i ' i-t 



l-t+h 



h Ji- 



b(s,X(s))ds\Vi- t 
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Using the same calculations and arguments as above (since hypothesis iv) in the definition 
of class Arf implies that b is continuous and bounded), we obtain that D*X(t) exists and 
is equal to -6(1 -t,X(l -t)). □ 

In the case of fractional Brownian motion of order H / 1/2, the Nelson derivatives do 
not exist. However, one can define new operators using the so-called quasi conditional 
expectation introduced by [1]. We refer to the work of Darses and Sausserau [19] for more 
details. 

1.5. A remark about reversed processes 

This part reviews basic results about reversed processes, with a special emphasis to 
diffusion processes. We use Nelson's stochastic calculus. 

Let X be a process in the class C 1 ([0, 1]). We denote by X the reversed process : 
X(t) = X(l — t), with his "past" Vt and his "future" Tt- As a consequence, we also have 
x e ^([0,1] -► H). 

Using the operators a and a* defined in definition 1.5,we have: 
Lemma 1.1. — b*x(t) = — bx(l — t) = —bx(t). 
Proof. — The definition of b* gives immediately: 

T t ■ 

But T t = a{x(s),t ^ s ^ 1} = a{x(u),0 ^ u ^ 1 - t} = V\- t - 
Thus: 

Kx(t) = lim — E 

6^0+ 

□ 

The same computation is not at all possible when dealing with the operators D and 



lim E 



x(l-t) -x(l-t + e) 



x(l-t + e) -x(l-t) 



= -dx(l — t) = -t>x(t). 
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STOCHASTIC DERIVATIVE 



In this part, we construct a natural extension^ of the classical derivative on real 
stochastic processes as a unique solution to an algebraic problem. This stochastic deriva- 
tive turns out to be necessarily complex valued. Our construction relies on Nelson's 
stochastic calculus [53]. We then study properties of our stochastic derivative and es- 
tablish a number of technical results, including a generalization of Nelson's product rule 
[53] as well as the stochastic derivative for functions of diffusion processes . We also com- 
pute the stochastic derivative in some classical examples. The main point is that, after 
a natural extension to complex processes, the real part of the second derivative of a real 
stochastic process coincide with Nelson's mean acceleration. We define a special class of 
processes called Nelson differentiable, which will be of importance for the stochastic cal- 
culus of variations developed in chapter 7. This part is self contained and all basic results 
about Nelson's stochastic calculus are reminded. 

2.1. The abstract extension problem 

In this section, we discuss in a general abstract setting, what kind of analogue of the 
classical derivative we are waiting for on stochastic processes. 

We first remark that rea^ 2 ) valued functions naturally embed in stochastic processes. 

Indeed, let / : R — > R be a given function. We denote by Xf the deterministic stochastic 
process defined by 

(2.1) X f (u) = f G Q. 

W A precise meaning to this word will be given in the following. It should be noted that Malliavin calculus 
is not an extension of the ordinary differential calculus (see below). 

^Our aim was first to study dynamical systems over I™. However, as we will see we will need to consider 
complex valued objects. 
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We denote by i : R R — > V the map associating to / € R M the stochastic process Xj. 

We denote by Vdet the subset of V consisting of deterministic processes, and by V^ ct 
the set i{C k ), k > 1. 

As a consequence, we have a natural action of the classical derivative on the set of 
differentiable deterministic processes, that we denote again d/dt. 

Let K = R or C. In the sequel, we denote by Vk C 5ft; a subset of the set of i\~-valued 
stochastic processes®. 

Let K = R or C. 

Definition 2.1. — Let K = R or C. An extension of d/dt on Vk is an operator 5, i.e. 
a map 5 : Vk — ► $k such that: 

i) 5 coincides with d/dt on V\ et , 

ii) 5 is R-linear. 

Condition i), which is a gluing condition on the classical derivative is necessary as 
long as one wants to relate classical differential equations with their stochastic counterpart. 

Condition ii) is more delicate. Of course, one has linearity of S on Diff. A natural idea is 
then to preserve fundamental algebraic properties of d/dt, R-linearity being one of them. 
This condition is not so stringent, if for example we consider K = C. But, following this 
point of view, one can ask for more precise properties like the Leibniz rule 

(2.2) d/dt(X-Y) = d/dt(X) ■ Y + X ■ d/dt(Y), VX,Y eV\ ct . 

In what follows, we construct a stochastic differential calculus based on Nelson's deriva- 
tives. 

2.2. Stochastic differential calculus 

In this part, we extend the classical differential calculus to stochastic processes using a 
previous work of Nelson [53] on the dynamical theory of Brownian motion. We define a 
stochastic derivative and review its properties. 



( 'We do not give more precisions on this set for the moment, the set V can be the whole set of real or 
complex valued stochastic processes, or a particular class like diffusion processes,... etc. 
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2.2.1. Reconstruction problem and extension. — Let us begin with some heuristic 
remarks supporting our definition and construction of a stochastic derivative. 

Our aim is to construct a "natural" operator on S 1 (I) which reduces to the classical 
derivative d/dt over differentiable deterministic processes^ 4 ) . The basic idea underlying 
the whole construction is that, for example in the case of the Brownian motion, the 
trajectories are non-differentiable. At least, this is the reason why Nelson [53] intro- 
duces the left and right derivatives DX and D*X for a given process X. If we refer to 
geometry, forgetting for a moment processes for trajectories, the fundamental property 
of the classical derivative dx/dt(to) of a trajectory x(t) at point to, is to provide a first 
order (geometric) approximation of the curve in a neighbourhood of to. One wants to 
construct an operator, that we denote by V, such that the data of VX(to) allows us to 
give an approximation of X in a neighbourhood of to- The difference is that we must 
know two quantities, namely DX and D*X, in order to obtain the information^ . For 
computational reasons, one wants an operator with values in a field F. This field must 
be a natural extension of R (as we want to recover the classical derivative) and at least of 
dimension 2. The natural candidate to such a field is C. One can also recover C by saying 
that we must consider not only R but the doubling algebra which corresponds to C. 

This informal discussion leads us to build a complex valued operator T> : C 1 (/) — ► C^. (/) , 
with the following constraints: 

i) (Gluing property) For X £ V\ eV VX{t) = dX/dt, 

ii) The operator V is R-linear, 

iii) (Reconstruction property) For X £ C 1 (I), let us denote by 

VX = A(DX, D*X) + iB(DX, D*X), 
where A and B are linear R- valued mappings by ii). We assume that the mapping 
(DX,D*X) ^ (A(DX,D*X),B(DX,D*X)) 

is invertible. 



1 'A rigourous meaning to this sentence will be given in the sequel. 

^This remark is only valid for general stochastic processes. Indeed, as we will see, for diffusion processes, 
there is a close connection between DX and D*A, which allows to simplify the definition of V. 
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Lemma 2.1. — The operator V has the form 

V^X = [aDX + (1 - a)D*X] + ifib [DX - D*X] , p = ±1, 
where a, 6 £ R and 6/0. 

Proof. — We denote by = aDX + bD,X and = cDX + dD*X. If X € C^I), 

we have = = dX/dt, and i) implies 

a + 6= 1, c + d = 0. 

We then obtain the desired form. By iii) , we must have b / in order to have invertibility. 

□ 

In order to rigidify this operator, we impose a constraint coming from the analogy with 
the construction of the scale-derivative for non-differentiable functions in [13]. 

iv) If A, = —D, then A(X) = 0, B(X) = D. 

We then obtain the following result: 

Lemma 2.2. — An operator V satisfying conditions i), ii), Hi) and iv) is of the form 

, s D + D* D-D* 

(2-3) = + *M— A* = ±1- 

Proof. — Using lemma 2.1, iii) implies the relations: 2a — 1 = and 26 = 1, so a = b = 

1/2. □ 

We then introduce the following notion of stochastic derivative: 

Definition 2.2. - We denote byV^ the operators defined by 

D + D* D-D* 
= + ^ , n = ±1. 

2.2.2. Extension to complex processes. — In order to embed second order differ- 
ential equations, we need to define the meaning of V 2 , and more generally of V n , n G N. 
The basic problem is that, contrary to what happens for the ordinary differential operator 
d/dt, even if we consider real valued processes X, the derivative VX is a complex one. 
As a consequence, one must extend V to complex processes. 

For the moment, let us denoted by T>£ the extension to be define of V, to complex 
processes. Let F be a field containing C to be defined, and T>c : CcOO F- There 
are essentially two possibilities to extend the stochastic derivative leading to the same 
definition: an algebraic and an analytic one. 
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2.2.2.1. Algebraic extension. — Let us assume that: 

i) the operator T>£ is R-linear. 

Let Z = X + iY be a complex process, where X and Y are two real processes. By 
R-linearity, we have 

V c (Z)=V c X + V c (iY). 

As T>£ reduce to V on real processes, we obtain 

V c (Z)=VX + V c (iY), 

which reduce the problem of the extension to find a suitable definition of V on purely 
imaginary processes. 

We now make an assumption about the image of V^: 

ii) The operator T>£ is C-valued. 

This assumption is far from being trivial, and has many consequences. One of them is 
that, whatever the definition of T>£(iY) is, we will obtain a complex quantity which mixes 
with the quantity T>X in a non trivial way. 

Remark 2.1. — One can wonder if another choice is possible, as for example, using 
quaternions in order to avoid this mixing problem. However, a heuristic idea behind the 
complex nature ofT> is that it corresponds to a fundamental property of Nelson processes, 
the (in general) non- differ entiable character of trajectories. Then, the doubling of the 
underlying algebra is related to a symmetry breaking^ . The computation of T> 2 is not 
related to such phenomenon. 

In the following, we give two different extensions of V to complex processes under 
hypothesis i) and ii). The basic problem is the following: 

Let Y be a real process. We denote 
(2.4) VY = S{Y) ± iA(Y), 

where 



(2.5) S(Y) 



D + D, 



(Y), and A(Y) 



D-D* 



(Y), 



^This reduces to DX = D*X for deterministic different iable processes, namely the invariance under 
h -> -h. 
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and the letters S and A stand for the symmetric and antisymmetric operators with 
respect to the exchange of D with D*. 

We denote 

(2.6) V c (iY) = R(Y)+iI(Y), 
where R(Y) and I(Y) are two real processes. 

One can ask if we expect for special relations between R(Y), I(Y) and S(Y), A(Y). 

2.2.2.1.1. C-linearity. - - If no relations are expected for, the natural hypothesis is to 
assume C-linearity of T>£, i.e. 

(2.7) V c (jY) = WY 

As a consequence, we obtain the following definition for the operator T>£\ 

We denote by 6^(7) the set of stochastic processes of the form Z = X + iY, with 

X,Y € e\i). 

Definition 2.3. — The operator T>£ : — > is defined by 

V C JX + iY) = V^X + ifiV^Y, M = ±1, 

where X,Y £ Q 1 . 

In the sequel, we denote T>£ for T>£ a . 

The following lemma gives a strong reason to choose such a definition of T>£ . We denote 

by 

=2Vo-..o2V. 



Lemma 2.3. - We have 



(2.8) 2^ = 



-D-D* + D*D 



+ i 



D - D: 



2 

Proof. — One use the C-linearity of operator V. □ 

We note that the real part of V 2 is the mean acceleration as defined by Nelson [53] . 

Remark 2.2. — In ([53], p. 81-82), Nelson discusses natural candidates for the stochastic 
analogue of acceleration. More or less, the idea is to consider quadratic combinations of 
D and D* , respecting a gluing property with the classical derivative: 
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Let Q a ,b,c,d( x i y) = ax2 + bxy + cyx + dy 2 be a real non- commutative quadratic form 
such that a + b + c + d = 1. A possible definition for a stochastic acceleration is Q(D,D*). 

We remark that the condition a + b + c + d= 1 implies that when D = D* , we have 
Q(D,D*) = D = D*. 

The simplest examples of this kind are: D 2 , D 2 , DD* and D*D. 

We can also impose a symmetry condition in order to take into account that we do not 
want to give a special importance to the mean-forward or mean-backward derivative, by 
assuming that Q(x,y) = Q(y,x), so that Q is of the form 

Q a (x,y) = a(x 2 +y 2 ) + (1 - 2a)^,aG R. 

The simplest example in this case is obtained by taking a = 0, i.e. 

DD* + D*D 
Q (D,D*) = *^ 

This last one corresponds to Nelson's mean acceleration and coincide with the real part of 
our stochastic derivative. 

It must be pointed out that Nelson discuss only five possible candidates where at least 
a three parameters family can be defined by Q a ,b,c,i-a-b-c(D, D*). His five candidates 
correspond to the simplest cases we have described. 

The choice of Qo(D, D*) as a mean acceleration is justified by Nelson using a Gaussian 
Markov process X(t) in equilibrium, satisfying the stochastic differential equation 

dX(t) = -coX(t)dt + dW{t). 

We will return to this problem below. 

2.2.2.2. Analytic extension. — We first remark that D and D* possess a natural extension 
to complex processes. Indeed, let X = X 1 + iX 2 , with X^ £ S 1 (I) then 

D(X X + iX 2 ) = D(X ± ) + iD(X 2 ) and D.(Xi + iX 2 ) = + iD*(X 2 ). 

As a consequence, the quantities S(Y) and A(Y) introduced in the previous section for 
real valued processes make sense for complex processes, and the quantity A{X) + iS(X) 
is well defined for the complex process X G 6^(7). As a consequence, we can naturally 
extend V{X) to complex processes by simply posing 

r>nr\ D + D * i D ~ D * 
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with the natural extension of D and D*. 

2.2.2.3. Symmetry. - - A possible way to extend V is to assume that the regular part 
of T>£(iY) is equal the imaginary part of T>(Y), i.e. that the geometric meaning of the 
complex and real part of VY is exchanged. We then impose the following relation: 

R{Y) = aA(Y). 

This leads to the following extension: 

Definition 2-4- — The operator T>£ : — > is defined by 

V C JX + iY) = V„X - i,xV^Y, fi = ±1, 

where IT 6 6 1 . 



2.2.3. Stochastic derivative for functions of diffusion process. — In the following, 
we need to compute the stochastic derivative of f(t,X t ) where X t is a diffusion process 
and / is a smooth function. Our main result is the following lemma: 

Lemma 2-4- — Let X e Ad and f G C l,2 (I x R d ) such that d t f, Vf and dijf are 
bounded. Then, we have: 



(2.9) 
(2.10) 



Df(t,X(t)) 
DJ(t,X(t)) 



d t f + DX(t) ■ Vf + -/'ih,! 
d t f + D+X{t) • Vf - ^duf 



(t,X(t)), 
(t,X(t)). 



Proof. — Let X £ A d and / G C 1 ' 2 ^ x R d ) such that d t f, Vf and %/ are bounded. 
Thus / belongs to the domain of the generators L t and L t of the diffusions X(t) and X(t). 
Moreover these regularity assumptions allow us to use the same arguments as in the proof 
of theorem 1.1 in order to write : 

Df(t,X(t)) = d t f(t,X(t))+L t (f(t,-))(X(t)) 



dtf + b l dj + -a* fyf 



(t,X(t)) 

(t,X(t)) 



and 



d t f + DX(t)-Vf + -a^d ij f 



DJ(t,X(t)) = dtf(t,X(t))-L^ t (f(t,-))(X(t)) 



d t f + D.X{t) ■ Vf - -a'tdijf (t, X{t)) 



□ 



We deduce immediately the following corollary 
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Corollary 2.1. — Let X G A d and f G C 1 ' 2 (J x W d ) such that d t f , V/ and dijf are 
bounded. Then, we have: 



(2.11) 
and 



dtf + V„X(t)-Vf + % ^ dijf 



(t,X(t)). 



Corollary 2.2. — Let X £ with a constant diffusion coefficient a and f € C 1, (Lx 
swc/i i/tai dtf, Vf and d^f are bounded. Then, we have: 



(2.12) 



V^f(t,X(t)) = 



dtf + V.X^-Vf + ^—Af 



(t,X(t)). 



2.2.4. Examples. — We compute the stochastic derivative in some famous examples, 
like the Ornstein-Uhlenbeck process and a Brownian mation in an external force. 

2.2.4-1- The Ornstein-Uhlenbeck process. — A good model of the Brownian motion of a 
particle with friction is provided by the Ornstein-Uhlenbeck equation: 

f91 o^ / X»{t) = -aX\t) + *Z(t) 

1 6) \ X(0) = X , X'(0) = V , 

where X(t) is the position of the particle at time, a is the friction coefficient, a is the 
diffusion coefficient, Xq and Vq are given Gaussian variables, £ is "white noise". The term 
—aX'(t) represents a frictional damping term. 



The stochastic differential equation satisfied by the velocity process V{t) :- 
given by: 

f dV{t) = -aV{t)dt + adW(t) 
I V(0) = V , 

We can explicitly compute VV and V 2 V: 



Y'{t) is 



(2.14) 



Lemma 2.5. — Let V(-) be a solution of 

f0 ,-, / dV(t) = -aV(t)dt + adW(t) 

[ b) 1 V(0) = V , 

2 

where Vq has a normal distribution with mean zero and variance 

Then V G C 2 (]0,+cx))) and: 

(2.16) VV{t) = -iaV{t) 

(2.17) V 2 V(t) = -a 2 V(t). 

Proof. — The solution is a Gaussian process explicitly given by: 



(2.18) 



Vt ^ 0, V(t) = V e~ at + a ( e 

Jo 



■a(t-s) 



dW(s) 
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Therefore, we can compute the expectation and the variance of the normal variable 
V(t): 

j E[V{t)] = E[V ]e- at 
( } { Var(V^)) = g + (Var(Vb)) - g) e 



-2at 



We notice, as in [30], that if Vo has a normal distribution with mean zero and variance 

of. 

2a' 



then X is a stationary gaussian process which distribution pt(x) at each time t reads 



(2.20) p t (a;) = -p-e-^. 
As a consequence, we have 

(2.21) Vt ^ 0,ln(p t (x)) = ln(^) - 
and 

(2.22) o^d x HPt( x )) = <J 2 -^ = -lax. 
Moreover, we have 

(2.23) DV(t) = -aV(t), 
and according to theorem 1.1, we obtain 

(2.24) D*V(t) = -aV(t) - a 2 d x ln( Pt (V(t))) = aV{t). 

Therefore T>V(t) = —iaV(t), and using the C— linearity of T>, we obtain V 2 V(t) = 
— a 2 V(t), which concludes the proof. □ 

2.2.4-2. Brownian particle submitted to an external force. — In some examples of random 
mechanics, one has to consider the stochastic differential system: 

' dX(t) = V(t)dt 

(2.25) I dV{t) = -aV(t)dt + K{X{t))dt + adW{t) 

k X(0) = X Q , V(0) = V , 

X and V may represent the position and the velocity of a particle of mass m being under 
the influence of an external force F = —VU where U is a potential. Set K = F/m. The 
"free" case K = is the above example. 

When K{x) = —uj 2 x (a linear restoring force), the system can also be seen as the ran- 
dom harmonic oscillator. In this case, it can be shown that if (Xq, Vq) has an appropriate 
gaussian distribution then (X(t),V(t)) is a stationary gaussian process in the same way 
as before. 



Let us come back to the general case. 
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First, we remark that X is Nelson-differentiable and we have DX(t) = D*X{t) = V(t). 
Moreover, Nelson claims in ([53], p. 83-84) that, when the particle is in equilibrium with a 
special stationary density, 

(2.26) DV(t) = -aV(t) + K(X(t)), 

(2.27) D*V{t) = aV(t) + K(X(t)). 
We can summarize these results with the computation of V : 



(2.28) 
(2.29) 



VX(t) = V(t), 
V 2 X(t) = K(X{t)) - iaV(t). 
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3.1. Product rules 

In chapter 7, we develop a stochastic calculus of variations. In many problems, we will 
need the analogue of the classical formula of integration by parts, based on the following 
identity, called the product or Leibniz rule 



where /, g are two given functions. 

Using a previous work of Nelson [53], we generalize this formula for our stochastic 
derivative. We begin by recalling the fundamental result of Nelson on a product rule 
formula for backward and forward derivatives: 

Theorem 3.1. — Let X, Y G C 1 (/) ; then we have: 



We refer to ([53],p.80-81) for a proof. 

Remark 3.1. — It must be pointed out that this formula mixes the backward and for- 
ward derivatives. As a consequence, even without our definition of the stochastic deriva- 
tive, which takes into account these two quantities, the previous product rule suggests the 
construction of an operator which mixes these two terms in a "symmetrical" way. 

We now take up the various consequences of this formula regarding our operator T>. A 
straightforward calculation gives: 

Lemma 3.1. — Let X, Y G C 1 (/), we then have: 



d ( t x d f , f d 9 



(P) 



(3.1) 



-E[X(t) • Y(t)} = E[DX(t) • Y{t) + X{t) ■ D*Y(t)} 



(3.2) 
(3.3) 



E[lm(VX(t)) • Y(t)} 




E[Re{VX(t)) • Y(t) + X(t) ■ Re(VY(t))] 
E[X(t) • lm(VY(t))} 
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Lemma 3.2. — Let X,Y G We write X = Xi + iX 2 and y = Y 1 + iy> w/iere 

Xj, Yi G e^i). Therefore : 

(3.4) £[p M x • y + x • p M y] = ^ 5 (x(t), y (0) + r(x(t), y(t)), 

w/iere 

(3.5) g(X,Y)=E[X-Y], 
and 

,, fi v = -2E\Yi ■ Im(P M X 2 )] - 2E[Y 2 ■ Im(D M Xi)] 
1 j +» (2£[yi • Im^Xi)] - 2£[y 2 • Im(P M X 2 )]) . 

Proof. — We have 

YV^X = yRe^Xi) - yiIm(D M X 2 ) 

(3.7) -^Im^Xi) - y 2 Re(P M X 2 ) 

+i (y 1 Im(P M X 1 ) + y 1 Re(P /1 X 2 ) + yRe^Xx) - y 2 Im(2^X 2 )) . 

In a symmetrical way, we obtain 

XV^Y = X 1 Re(P M y 1 ) - X 1 Im(P /i y 2 ) 

(3.8) -X 2 Im(P M y 1 ) - X 2 Re(P /i y 2 ) 

+t (X 1 Im(P /i y 1 ) + XxRe^y) + X 2 Re(^y) - X 2 Im{V^Y 2 )) . 

Forming the sum of these expressions and using lemma 3.1, we obtain (3.4). □ 

The next lemma will be of importance in chapter 7 for the derivation of the stochastic 
analogue of the Euler-Lagrange equations: 

Lemma 3.3. — Let X,Y G 6^(7). We write X = X 1 + iX 2 and Y = Y 1 + iY 2 where 
Xi,Yi G C 1 (7). Therefore, we have: 

(3.9) E[V^X • y + X • V.^Y] = j t g{X(t),Y{t)) 
where g(X, Y) = E[X X ■Y 1 -X 2 - Y 2 ] + iE[Y Y -X 2 + Y 2 - X x ] = E[X ■ Y] 

Proof. — We have 

yv^x = y^^xo - yi^(p^x 2 ) 

(3.10) -ya^xo - y 2 »(^^ 2 ) 

+i (Y&fPuXi) + y^(p M x 2 ) + Y^iv^) - y 2 Q(p M x 2 )) , 

and in a symmetrical way 



XP^y = {X 1 +iX 2 ){V ll Y 1 +W ll Y 2 ) 

mn = Xx^yo + XxQ^y,) 

1 j +x 2 o?(p /i y 1 ) - x 2 »(p /i y 2 ) 

+i (-x^cr^y) + x^^y,) + x 2 3f?(p /1 y 1 ) + x 2 3?(p^y 2 )) . 

We form the sum of these expressions and we use the lemma 3.1 to obtain (3.4). □ 
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3.1.1. A new algebraic structure. — A convenient way to write equation (3.9) is to 
use the following Hermitian product: 

For all X,Y £ V<£, we denote by * the product 

(3.12) X*Y = X-Y, 
where . denotes the usual scalar product. 

Formula (3.9) is then equivalent to: 

(3.13) VE[X*Y]=E[DX*Y + X*VY], 

where we have implicitly used the fact that V reduces to d/dt when this quantity has a 
sense. 



This new form leads us to the introduction of the following algebraic structure, which 
is, as far as we know, new. Let 6 be the canonical mapping 

(3-14) 5: ICVVC - 

K J X®Y i-> X*Y. 

We define for T> the quantity A(X>) = T> % 1 + 1 % P, which we will call the coproduct 
of V. Then, denoting by E the classical mapping which takes the expectation of a given 
stochastic process, we obtain the following diagram: 



(3.15) 



V C ®V C 
X®Y 
s 

K*Y 
E 

E[X * Y] 



A(D) 



V 



V C 0V C 
VX ® Y + X <g> VY 
S 

VX*Y+X*VY 
E 

E[VX*Y + X*VY] 



This structure is similar to the classical algebraic structure of Hopf algebra. The difference 
is that we perturb the classical relations by a linear mapping, here given by E. It will be 
interesting to study this kind of structure in full generality. 



3.2. Nelson differentiable processes 

3.2.1. Definition. — We define a special class of processes, called Nelson-differentiable 
processes, which will play an important role in the stochastic calculus of variations of 
chapter 7. 
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Definition 3.1. — A process X G C 1 (/) is called Nelson differentiable if DX = D*X. 

Notation 3.1. - - We denote by AT 1 (I) the set of Nelson differentiable processes. 

A better definition is perhaps to use V instead of D and saying that Neison 
differentiable processes have a real stochastic derivative. 

The main idea behind this definition is that we want to define a class V of processes in 
&{T) such that if X G &■{!) then for all Y G V, we have 

lm(V(X + Y)) = lm(VX). 

This condition imposes that Im(PY) = 0. 

This condition will appear more clearly in chapter 7 concerning the stochastic calculus 
of variations. 

Remark 3.2. - - We must keep in mind that our definition of the stochastic derivative 
follows the idea of the scale calculus developed in [13] to study non- differentiable functions. 
In that context, the existence of an imaginary part for the scale derivative of a function 
is seen as a resurgence of its non- differentiability. In particular, when the underlying 
function is differentiable then the scale derivative is real. That is why we have chosen to 
call processes such that D = D* Nelson differentiable. 

The definition of Nelson differentiable processes is only given for processes in S 1 (7). It 
is not at all clear to know what is the correct extension to G^(I). As we have no use of 
such kind of notion on 6^(7) we don't discuss this point here. 

Of course a difficult problem is to characterize these processes. The next section dis- 
cusses some examples. 

3.2.2. Examples of Nelson-difFerentiable process. — We give examples of Nelson- 
differentiable processes. 

3.2.2.1. Differentiable deterministic process. — It is probably the first and the simplest 
example. Let x(-) be a differentiable deterministic process defined on I x fl The past V 
and the future T are trivial: 

Vt€ J, V t = T t = {0,0}. 

As a consequence, we have 

Vt G I, Dx(t) = D*x{t) = x'(t), 
where x' is the usual derivative of x. 
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3.2.2.2. A very special random example. — Let X G C 1 (/). In [53], Nelson shows that 
X is a constant (i.e. X(t) is the same random variable for all t) if and only if : Vt G 
/, DX(t) = D*X(t) = 0. So it provides us a random example of AA 1 (/)— process. 

3.2.2.3. Nelson- differentiable diffusion processes. - Using theorem 1.1, we can find a 
sufficient and necessary condition for a diffusion process to be a Nelson-differentiable 
process: 

Lemma 3-4- — Let X G A<f with a = const, then X G AA 1 (7) if and only if 

(3.16) V(a 2 p)(t,X(t)) = 0. 

When the diffusion equation is time homogeneous and the solutions have a density, we 
note that this density must be a stationary density. Moreover, the Fokker-Planck equation 
(Kolmogorov forward equation) allows us to give a necessary condition (a relation between 
the drift and the diffusion coefficient) for a diffusion equation to give a Nelson-differentiable 
solution. 

3.2.2.4- The random harmonic oscillator. — The random harmonic oscillator satisfies the 
stochastic differential equation: 

' dX(t) = V(t)dt 

(3.17) I dV(t) = -aV(t)dt - u 2 X{t)dt + adW(t) 

k X(0) = X , V(0) = V , 

As a consequence, we have X(t) = / V(s)ds with E / ^(s)] 2 ds < 00 (6 > 0), and 

Jo Uo 

X has a strong derivative in L 2 . We then obtain DX(t) = D*X(t) = V(t). Finally, we 
have X G ^([0,6]) and VX(t) = V(t). 

3.2.3. Product rule and Nelson-differentiable processes. — 
Corollary 3.1. — Let X, Y G 6^(7). If X is Nelson-differentiable then : 

(3.18) E[V^X(t) ■ Yit) + X(t) ■ V^Yit)] = ^E(X(t), Y(t)) 

Proof. — This is a simple consequence of the fact that if X = X\ + iX2 is Nelson- 
differentiable then Im(£> M A"i) = Im^A^) =0. □ 



PART II 



STOCHASTIC EMBEDDING 
PROCEDURES 



CHAPTER 4 



STOCHASTIC EMBEDDING OF DIFFERENTIAL 

OPERATORS 



A natural question concerning ordinary and partial differential equations concerns their 
behaviour under small random perturbations. This problem is particularly important in 
natural phenomena where we know that models are only an approximation of the real 
setting. For example, the study of the long term behaviour of the solar system is usually 
done by running numerical computations on the n-body problem. However, many effects 
in the solar systems are not included in this model and can be of importance if one looks 
for a long term integration, as non conservative effects (due to tidal forces between planets) 
and the oblatness of the sun which is not yet modelled by a differential equation. 

The main problem is then to find the correct analogue of a given differential equation 
taking into account the following facts: 

i) The classical equation is a good model at least in first approximation, 

ii) One must extend this equation to stochastic processes. 

Using the stochastic derivative introduced in the previous part, we give a natural em- 
bedding of partial or ordinary differential equations into stochastic partial or ordinary 
differential equations. It must be pointed out that we do not perturb the classical equa- 
tion by a random noise or anything else. In this respect we are far from the usual way of 
thinking underlying the fields of stochastic differential equations or stochastic dynamical 
systems. 

Of course, having this natural embedding, we can naturally define what a stochastic 
perturbation of a differential equation is. This is simply a stochastic perturbation of the 
stochastic embedding of the given equation. The main point is that we stay in the same 
class of objects dealing with perturbations, which is not the case in the stochastic theory 
of differential equations, where we jump from classical solutions to stochastic processes in 
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one step using for example Ito's stochastic calculus^ 1 ) . 

In this part we first give a general embedding procedure for partial differential equations. 
We discuss classical examples, in particular first and second order differential equations. 
The case of Lagrangian systems is studied in details in chapter 7. An important part 
of classical differential equations coming from mechanics are reversible. This property 
is not conserved by the previous stochastic embedding procedure. We define a special 
embedding called reversible, which preserves this property, meaning that if X is a solution 
of the stochastic embedded equation, then X, the reversed process, is again a solution. 



4.1. Stochastic embedding of differential operators 

In this part, we first give an abstract embedding procedure based on an extension of 
the classical derivative defined in the previous part. We then specialize our embedding 
procedure using the stochastic derivative. 

4.1.1. Abstract embedding. — Let A be a ring, we denote by A[x] the ring of poly- 
nomials with coefficients in A. Let A = C 1 (R d x R). 

Definition J^.l. — A differential operator is an elements of A[d/dt] . 

Let O G A[d/dt], the differential operator O is of the form 

d d n 

(4.1) O = a (; t) +a 1 (;t)—-\ h a„(», t)-j^, a« G A, = 0, . . . , n, 

for a given n G N, called the degree of O. 

The action of O on a given function x : R — > R d , 1 i— ► x(t) is denoted O ■ x and defined 

by 

(4.2) 0-x = J2«t),t)^. 

Definition 4-2 (Abstract stochastization). — Let O G A[d/dt] be a differential op- 
erator, of the form 

d d n 

(4.3) O = ao{;t) + ai (;t)— + --- + a n (;t) — , a { G A, =0,...,n, 

where n € N is given. 



^This remark is also valid for all the theories of this kind, using your favourite stochastic calculus, like 
Malliavin calculus for example. 
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The stochastic embedding of O with respect to the extension 5 : V — > V is an element 
O5 ofP[5] defined by 

(4.4) 5 = a (; t) + ai(», t)S + --- + a n {; t)5 n , ai S V, i = 0,...,n, 
where 5 n = 5 o • • • o 5. 

The action of 0$ on a given stochastic process X, denoted by 0$ • X is defined by 

n 

(4.5) O s -X = Y j a l (X,t)5 i X, 

i=0 

where the notation ai(X,t) stands for the stochastic process defined for all uj G O by 

(4.6) cn(X,y)(u) = ai (X(u,t),t). 

The main property of this embedding is the fact that 

(4.7) O s \v Sct =0, 

so that the classical differential equation associated to O, and given by 

O ■ x = 0, (E) 
is contained in the stochastic differential equation 

O 5 -X = 0. (SE). 

4.1.2. Nelson Stochastic embedding. — Using the stochastic derivative, we have a 
particular stochastic embedding procedure. 

Definition 4-3 (Stochastization). — Let O G A[d/dt] be a differential operator, of the 
form 

d d n 

(4.8) O = a (;t) + a 1 (;t)— + --- + a n (;t) — , ai £ A, =0,...,n, 

where n G N is given. 

The stochastic embedding of O with respect to the stochastic extension is an element 
Ostoc o/C 1 (7)[P (T ] defined by 

(4.9) O s toc = ao(; t) + oi(«, t)V + ■ ■ ■ + a n (; t)V n , a { £ C\l), i = 0, . . . , n. 

We denote by S the operator associating to an operator O of the form 4.8 the operator 
O stoc . As a consequence, we will frequently use the notation S(O) for O stoc - 
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In some occasions, in particular for the Euler-Lagrange equation, we will need to con- 
sider differential operators in a non-standard form. Precisely, we need to consider operators 
like 

(4.10) B a = j t oa(;t). 
This notation means that B a acts on a given function as 

(4.11) B a -x=j t (a(x(t),t))). 

The basic idea is to define the stochastic embedding of B a as follow: 

Definition 4-4- — The stochastic embedding of the basic brick B a is given by 

(4.12) B a = Voa{.,t). 

However, classical properties of the differential calculus allow us to write B a equivalently 

as 

(4.13) Ba .x = a'(x)^. 

The stochastic embedding of this new form of B a is given by 

(4.14) M a .X = a'{X)VX. 
The main problem is that in general, we do not have 

(4.15) B a = B , 
as in the classical case. 

This reflects the fact that 8 acts on operators of a given form and not on operators as 
an abstract element of a given algebra. In particular, this is not a mapping. 

Nevertheless, there exists a class of functions a such that equation (4.15) is valid: 

Lemma 4-1- — Equation (4-15) is satisfied on the set with constant diffusion if a is 
an harmonic function. 

Proof — This follows easily from corollary 2.2. □ 

In the sequel we study some basic properties of this embedding procedure on differential 
equations. 
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4.2. First examples 

4.2.1. First order differential equations. — Let us consider a first order differential 
equation 

(IT 

- = /(x,t), I -(ODE) 

where x 6 R and /:RxR^IRisa given function. The stochastic embedding of (1-ODE) 
leads to 

M = F(X,f), l-(SODE) 

where F is real valued. 

The reality of F imposes important constraints on solutions of l-(SODE). Indeed, we 
must have 

DX = D*X, 

so that X belongs to the class of Nelson-differentiable processes. 

In our general philosophy, ordinary differential equations are only coarse approxima- 
tions to reality which must include stochastic behaviour in its foundation. A stochastic 
perturbation of a first order differential equation is then highly non-trivial. Indeed, we 
must consider SODE's of the form 

VX = F{X,t) + eG(X,t), 

where G(X, t) is now complex valued. As a consequence, we allow solutions to leave the 
Nelson-differentiable class. 

4.2.2. Second order differential equations. — Let us consider a second order dif- 
ferential equation 

— +a(x)- + b(x) = 0, (2 -(ODE) 

where x £ R, and a, b : R — > R are given functions. The stochastic embedding of (2 — 
(ODE)) leads to 

V 2 X + a(X)VX + b(X) = 0. 

In this case, contrary to what happens for first order differential equations, we have no 
reality condition which constrains our stochastic process. 



In order to study such kind of equations, one can try to reduce it to a first order 
equation, using standard ideas. We denote by Y = VX, then the second order equation 



54 



CHAPTER 4. STOCHASTIC EMBEDDING OF DIFFERENTIAL OPERATORS 



(4.16) 



is equivalent to the following system of first order stochastic differential equations: 

VX = Y, 

VY = -a(X)Y - b(X). 
One must be careful to take Y € C^(i") as Y is a priori a complex stochastic process. This 
remark is of importance since if we apply the stochastic embedding procedure^ 2 ) to the 
classical system of first order differential equations 



(4.17) 



dx 

-it = y, 



r| = -a(x)y - b(x), 



by saying that we apply separately the stochastic embedding on each differential equa- 
tions, we obtain the stochastic equation (4.16) but with Y € C 1 (/), which imposes strong 
constraints on the solutions of our equations. 

This example proves that the stochastic embedding procedure is not so easy to define 
if one wants to deal with systems of differential equations. We will return on this problem 
concerning the stochastic embedding of Hamiltonian systems. 



^Note that we have not defined the stochastic embedding procedure on systems of differential equations. 
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REVERSIBLE STOCHASTIC EMBEDDING 



5.1. Reversible stochastic derivative 

In our construction of the stochastic derivative, we have imposed some constraints as for 
example the gluing to the classical derivative on differentiable deterministic processes. We 
have moreover kept some properties of the classical derivative such as linearity. However, 
we have not conserved more important properties of the classical derivative which are used 
in the study of classical differential equations. For example, let us consider 

$ = >«• < E » 

which is the basic equation of Newton's mechanics. An important property of this kind 
of equations is its reversibility: 

Let t — ► x(t) be a solution of (E). We denote by x(t) = x(—t). Then, we have 

§ = |(-|(- 4 )) = $H) = /W-'» -/<*«». 

proving that the reversed solution x(t) is again a solution of the same equation. In this 
case, we say that the differential equation is reversible. 

The reversibility argument used the following important property: 

!«-<»--£<-«■ (R) 

The natural way to introduce a notion of reversibility is then to look for the stochastic 
differential equation satisfied by X(t) = X(—t) € C 1 (7) the reversed processes. However, 
in general, we do not have access to DX or D^X. As a consequence, a definition using 
this characterization is not effective. In the following, we follow a different strategy. 
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A convenient way to characterize the reversibility of a given differential equation, de- 
scribed by a differential operator 

(5.1) = Y^Oi^€R[d/dt] 

i 

is to prove that this operator is invariant under the substitution 

(5.2) r : R[d/dt] — ► R[d/dt] 
which is R linear and defined by 

(5.3) r{d/dt) = -d/dt. 

We then introduce in our setting, the following analogous substitution: 

Definition 5.1. — The reversibility operator R : C[D,D*] — > C[Z),-D*] is a C morphism 
defined by 

(5.4) R(D) = —D*, R(D^) = -D. 

We have the following immediate consequence of the definition: 
Lemma 5.1. — The reversibility operator is an involution of C[D,D*\. 

This operator acts non trivially on our stochastic derivative. Precisely, we have: 
Lemma 5.2. — 

(5.5) Rip) = —V. 

The complex nature of the stochastic derivative induces new phenomenon which are 
different from the classical case. For example, we have 

(5.6) R(V 2 )=V 2 , 
contrary to what happens for r. 

We now define our notion of a reversible stochastic equation. 

Definition 5.2. — [Reversibility] Let O G R[D, D*\, then the stochastic equation O X = 
is reversible if and only if R{0) ■ X = 0. 

A natural problem is the following: 



Reversibility problem: Find an operator such that the stochastic embedding of a 
reversible equation is again a reversible equation in the sense of definition 5.2. 



5.1. REVERSIBLE STOCHASTIC DERIVATIVE 
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Let us consider the family of stochastic derivatives T>^, [i = 0, ±1. Without assuming 
a particular form for the underlying equation, the preservation of the reversible character 
reduces to prove that the operator 5 which is chosen satisfies 

(5.7) R(S) = -5. 

In the family of stochastic derivatives V^, \i = 0, ±1, only one case is possible: 

Lemma 5.3. — A reversibility of a differential equation is always preserved under a 
stochastic embedding if and only if this embedding is associated to the stochastic derivative 
V . 

Proof. — Essentially this follows from equation (5.5). If we want to preserve reversibility 
then the operator must satisfied R{T>^) = — V^. This is only possible if is real, i.e. 
H = 0. □ 

It must be pointed out that the operator 

D + 

has been obtained by different authors using the following argument: 

If we use only D (or D*) then, we give a special importance to the future (or past) of 
the process, which has no physical justification. As a consequence, one must construct 
an operator which combines these two quantities in a more or less symmetric way. The 
simplest combination is a linear one aD + bD* with equal coefficients a = b. The gluing 
to the classical derivative leads to a = b = 1/2. 

The problem with this construction is that this argument is used on diffusion processes, 
where D and are not free. As a consequence, working with D is the same (even if 
the connection with is not trivial) than working with D*. We can not really justify 
then the use of Vq. It must be pointed out that E. Nelson [53] does not use Vq in his 
derivation of the Schrodinger equation, but simply D. 

Here, this operator is obtained by specialization of V^, which form is imposed by our 
construction (linearity, gluing to the classical derivative, reconstruction property). The 
reconstruction property imposes that fi ^ unless we work with diffusion processes. 

Imposing a new constraint on the reversibility on this operator leads us to /j, = 0. The 
operator Vq is of course defined on C 1 (I), but in order to satisfy the whole constraints of 
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our construction, we must restrict its domain to diffusion processes. 

We can of course find reversible equations without using T>q but T>^. We keep the 
notations and conventions of chapter 4. We first define the action of R on a given operator 
of the form 

n 

(5.8) = Y j a l (;t)(-iyv\ 

Definition 5.3. — The action of R on (5.8) is denoted R(0) and defined by 

n 

(5.9) R{0) = Y,*i{;t)V\ 

i=0 

The definition 5.2 of a reversible equation can then be extended to cover operators of 
the form 5.8. 

Using this definition, we can prove that the stochastic equation 

VlX = -VU(X), (E) 

is reversible. 

Indeed, we have: 
Lemma 5-4- — Equation (E) is reversible. 
Proof. — We have 

R(VlX + VU(X)) = V 2 X + VU(X), 
= V*X + VU(X). 

As U is real valued and X are real stochastic processes, we deduce from (E) that 



(5.11) V\X = -VU(X) = -VU{X). 
We deduce that 

(5.12) R(VlX + VU{X)) = 0, 

which concludes the proof. □ 



5.2. Iterates 

There exists a fundamental difference between Vq and 2? M , fj, ^ 0. The operator Vq 
send real stochastic processes to real stochastic processes in the contrary of X> M , fi / 0, 
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which leads to complex stochastic processes. As a consequence, the n-ieme iterates of V$ 
is simply defined by 

(5.13) V% = V o...oV , 

without problem, where a special extension of 2? M , \i / to complex stochastic processes 
must be discussed. 

5.3. Reversible stochastic embedding 

Using Dq, we can define a stochastic embedding which conserves the fundamental prop- 
erty of reversibility of a given equation. We keep notations from chapter 4. 

Definition 5-4 (Reversible stochastization). — Let O G A[d/dt] be a differential op- 
erator, of the form 

d d n 
O = a («,t) + ai(«,f)— H \-a n (;t) — , a« G A, =0, ...,n, 

where n G N is given. 

The reversible stochastic embedding of O is an element O rey o/C 1 (I)[Po] defined by 

(5.14) O rev = a (•,*)+ a! {*,t)V + ■ ■ ■ + a n {*,t)V%, ctj G C 1 (-f), i = 0,...,n. 

A differential equation (E) is defined by a differential operator O G „4[d/dt], i.e. an 
equation of the form 

O ■ x = 0, (£) 

where £ is a function. 

Using stochastization, the reversible stochastic analogue of (E) is defined by 

O rev • X = 0, (RSE) 

where X is a stochastic process. 

5.4. Reversible versus general stochastic embedding 

The reversible stochastic embedding leads to very different results than the general 
stochastic embedding. We can already see this difference on first order differential equa- 
tions. Let us consider 

dx 

Tt =f{x) ' 

where x G R and / is a real valued function. The reversible stochastic embedding gives 

V X = f(X). 

Contrary to what happens for the stochastic embedding, this equation does not impose 
for the solution to be a Nelson differentiable processes. 
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5.5. Stochastic mechanics and the Stochastization procedure 

5.5.1. The Stochastic Newton Equation. — The stochastized version of the classical 
system: 

x{t) = v(t) 

(5.15) v(t) = K(x{t)) 
is given by: 

VX{t) = V(t) 

(5.16) VV(t) = K(X{t)) 

where V 6 Sc(-0 and K is a force: K{x) = —VU(x) and U a potential. 

We can give at least two different kind of solutions of this equation, and so two relevant 
models. 

In the first one, the component X is the position in the Ornstein-Uhlenbeck theory of 
Brownian Motion and is not submitted to a random noise. The system writes: 

( dX(t) = V{t)dt 

(5.17) I dV(t) = -aV(t)dt + K(X(t))dt + adW(t) 
{ X(0) = X , V(0) = V , 

We have noticed in a previous section that, at an equilibrium (i.e. X has a stationary 

density) and if e~ u is integrable, then: 

(5.18) VX(t) = V(t), 

(5.19) V 2 X(t) = K(X(t))-iaV(t). 

Therefore (X, V) solves the Newton stochastized system (5.16) if and only if a = 0. 
Moreover we note in this particuliar case that X is a Nelson-differentiable process. 

The second one is described by 

(5.20) dX(t) = b(t, X(t))dt + adW{t), 

where the function b must be determined. In this case, we proved that the density pt(x) 
of a solution X of (5.16) writes pt(x) = ^(t,x)^(t,x) where VP solves the Schrodinger 
equation: ia + ^dxx^ = U^f. In this case, X is driven by a Brownian motion and is 
not Nelson-differentiable. 
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STOCHASTIC EMBEDDING OF 
LAGRANGIAN AND HAMILTONIAN 

SYSTEMS 



CHAPTER 6 



STOCHASTIC LAGRANGIAN SYSTEMS 



Most of classical mechanics can be formulated using Lagrangian formalism ([5], [2]). 
Lagrangian mechanics contains important problems, like the n-body problem. Using our 
framework, we study Lagrangian dynamical systems under stochastic perturbations^ 1 ). 

Our approach is first to embed classical Lagrangian systems, in particular the associated 
Euler-Lagrange equation (EL) in order to obtain an idea of what kind of equation govern 
stochastic Lagrangian systems. We then develop a stochastic calculus of variations. We 
obtain an analogue of the least- action principle^ which gives a second stochastic Euler- 
Lagrange equation, denoted by (SEL) in the sequel. We then prove the following surprising 
result, called the coherence lemma: we have §>(EL) = (SEL). 

The principal interest of Lagrangian systems is that the action of a group of symmetries 
leads to first integrals of motion, i.e. functions which are constants on solutions of the 
equations of motion. The celebrated theorem of E. Noether gives a precise relation between 
symmetries and first integrals. We prove a stochastic analogue of E. Nother theorem. 

Finally, we prove that the stochastic embedding of Newton's Lagrangian systems lead 
to a non linear Schrodinger's equation for a given wave function whose modulus is equal 
to the probability density of the underlying stochastic process. 



6.1. Reminder about Lagrangian systems 

We refer to [5] for more details, as well as [2]. 



( 'For the n-body problem, which is usually used to study the long term behavior of the solar system 
[47], this problem is of crucial importance. Indeed, the n-body problem is only an approximation of the 
real problem, and even if some numerical simulations take into account relativistic effects [40], this is not 
sufficient [50]. 

( 2 -*In our case, the word least-action is misleading and a better terminology is stationary (see below). 
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Lagrangian systems play a central role in dynamical systems and physics, in particular 
for mechanical systems. A Lagrangian system is defined by a Lagrangian function, com- 
monly denoted by L, and depending on three variables: x, v, and t which belongs in the 
sequel to K. As Lagrangian systems come from mechanics, the letter x stands for position, 
the letter v for speed and the letter t for time. In what follows, we consider a special type 
of Lagrangian function called admissible in the following. 

Definition 6.1. — An admissible Lagrangian function is a function L such that: 

i) The function L(x,v,t) is defined on R d x C d x R, holomorphic in the second variable 
and real for v £R. 

ii) L is autonomous, i.e. L does not depend on time. 

Condition i) is fundamental. This condition is necessary in order to apply the stochas- 
tization procedure (see below). The fact that we only consider autonomous Lagrangian 
function is due to technical difficulties in order to take into account backward and forward 
nitrations in the computation of the stochastic Euler-Lagrange equation (see below). 

Remark 6.1. — In applications, admissible Lagrangian functions L are analytic exten- 
sions to the complex domain of real analytic Lagrangian functions. For example, the 
classical Newtonian Lagrangian L(x,v) = (l/2)v 2 — U(x), defined on an open^ subset of 
Rxl, with an analytic potential is an admissible Lagrangian function. 

A Lagrangian function L being given, the equation 



is called the Euler-Lagrange equations. 

An important property of the Euler-Lagrange equation is that it derives from a vari- 
ational principle, namely the least action principle (see [5], p. 59). Precisely, a curve 



on the space of curves passing through the points x(a) = x a and x(b) = Xb, if and only if 
it satisfies the Euler-Lagrange equation along the curve x(t). 

^This Lagrangian function is not always denned on I x 1. An example is given by Newton's potential 
U(x) = l/x, x £ R*. 

( 4 -*We refer to [5], chapter 3, §.12 for an introduction to the calculus of variations. 




(EL) 



7 : 1 1— ► x(t) is an extremal^ of the functional 




6.3. THE COHERENCE PROBLEM 
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6.2. Stochastic Euler-Lagrange equations 

We now apply our stochastic procedure 8 to an admissible Lagrangian. 

Lemma 6.1. — Let L(x,v) : K d x C d — > C be an admissible Lagrangian function. The 
stochastic Euler-Lagrange equation obtained from (EL) by the stochastic procedure is given 
by 

(dL, /N \ dL 



Proof. — The Euler-Lagrange equation associated to L(x,v) can be seen as the following 
differential operator 

Q _ d Q dL dL 
dt dv dx ' 
acting on (x(t),x(t)). The embedding of Oel gives 

^ dL dL 
Oel=V i1 o — -—. 

ov ox 

As Oel acts on (x(t),x(t)), the operator Oel acts on (X(t),V^X(t)). This concludes the 
proof. □ 

The free parameter fx € {— 1, 0, 1} can be fixed depending on the nature of the extension 
used. 



It must be pointed out that there exist crucial differences between all these extensions 
due to the fact that is complex valued for [i = ±1 and real for fi = 0. Indeed, let us 
consider the following admissible Lagrangian function: 

L(x,v) = \ 2 -U(x), 

where U is a smooth real valued function. Then, equation §(EL) gives 

V li V = U(X), 

where V = V^X. When \i = ±1, this equation imposes strong constraints on X due to 
the real nature of U(X), namely that V^X G N^J). 

On the contrary, when fj, = 0, i.e. in the reversible case, these intrinsic conditions 
disappear. 



6.3. The coherence problem 

Up to now, the stochastic embedding procedure can be viewed as a formal manipulation 
of differential equations. Moreover, as most classical manipulations on equations do not 
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commute with the stochastic embedding, this procedure is not canonical ( 5 ). In order 
to rigidify this construction and to make precise the role of this stochastic embedding 
procedure, we study the following problem, called the coherence problem: 

We know that the Euler-Lagrange equations are obtained via a least-action principle 
on a functional. The main problem is the existence of a stochastic analogue of this least- 
action principle, that we can call a stochastic least action principle, compatible with the 
stochastic embedding procedure. 



(6.1) L(x(t),x(t)) — +L(X(t),T>X(t)) 



Least action principle 



Stochastic least action principle ? 



? 



(EL) (SEL) 

In the next chapter, we develop the necessary tools to answer to this problem, i.e. a 
stochastic calculus of variations. Note that due to the fact that the stochastic Lagrangian 
as well as the stochastic Euler-Lagrange equation are fixed, this problem is far from being 
trivial. The main result of the next chapter is the Lagrangian coherence lemma which 
says precisely that the stochastic Euler-Lagrange equation obtained via the stochastic 
embedding procedure coincide with the characterization of extremals for the functional 
associated to the stochastic Lagrangian function using the stochastic calculus of varia- 
tions. As a consequence, we obtain a rigid picture involving the stochastic embedding 
procedure and a first principle via the stochastic least action principle. 



This picture will be then extended in another chapter when dealing with the Hamilto- 
nian part of this theory. 



( 'We return to this problem in our discussion of a stochastic symplectic geometry which can be used to 
bypass this kind of problem. 



CHAPTER 7 



STOCHASTIC CALCULUS OF VARIATIONS 



The embedding procedure allows us to associate a stochastic Euler-Lagrange equation 
to a stochastic Lagrangian function. A basic question is then the existence of an analogue 
of the least action principle. In this section, we develop a stochastic calculus of variations 
for our Lagrangian function following a previous work of K. Yasue [71]. Our main result, 
called the coherence lemma, states that the stochastic Euler-Lagrange equation can be 
obtained as an application of a stochastic least action principle. Moreover, this derivation 
is consistent with the stochastic embedding procedure. 

7.1. Functional and L-adapted process 

In the sequel we denote by / a given open interval (a, b), a < b. 

We first define the stochastic analogue of the classical functional. 

Definition 7.1. — Let L be an admissible Lagrangian function. The functional associ- 
ated to L is defined by 



for allX £ S^J). 

In what follows, we need a special notion introduced by Yasue [71], and called L- 
adaptation: 

Definition 7.2. — Let X £ S 1 (I) be a stochastic process. We denote by V and T 
the past and the future of X. Let L be an admissible Lagrangian function. A process 
X G C 1 (I) is called L-adapted if: 




b 



(7.1) 



i) — — (X(t), V^X(t)) is adapted to V and J 7 . 
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ii) ^-(X(t),v,x(t))ee\i). 

Diffusion processes are L-adapted. 



7.2. Space of variations 

Calculus of variations is concerned with the behaviour of functionals under variations 
of the underlying functional space, i.e. objects of the form 7 + h, where 7 belongs to the 
functional space and h is a given functional space of variations. A special care must be 
taken in our case to define what is the class of variations we are considering. In general, 
this problem is not really pointed out as both variations and curves can be taken in the 
same functional space (see [5],p.56,footnote 26). We introduce the following terminology: 

Definition 7.3. — Let P be a subspace of and X € C 1 (I). A P -variation of X is 

a stochastic process of the form X + Z , where Z G P. 

In the sequel, we consider two subspaces of variations: ^(i) and C 1 (7). 

The choice of C 1 (I) is natural. However, doing this we can obtain stochastic processes 
with completely different behaviour than X^\ 

What is the specific property of X G C 1 (I) that we want to keep ? 

If we refer to the construction of the stochastic derivative, then a main point is the exis- 
tence of an imaginary part in V^X^ 2 \ This property is related to the non-differentiability 
of the underlying stochastic process. We are then lead to search for variations Z which con- 
serve this imaginary part. As a consequence, we must consider Nelson difference processes 
introduced in the previous part^ 3 \ and denoted by N^J). 



7.3. Differentiable functional and stationary processes 

We now define our notion of differentiable functional. Let P be a subspace of C 1 (/). 



^'Of course, this is not the case in the classical case: one consider x 6 C°°{I) and z 6 C°°(I) such that 
x + h 6 C°°(I) is very similar to x. For example, we don't choose z € C°(I) which leads to radically new 
behaviour of x + z with respect to x. 

( 2 '0f course, as long as fi = ±1. This is of importance since we will be able to choose a more general 
variations space in this case. 

( 3 ^An analogous problem is considered in [14], where a non differentiable variational principle is defined. 



7.3. DIFFERENTIABLE FUNCTIONAL AND STATIONARY PROCESSES 
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Definition 7.4- — Let L be an admissible Lagrangian function and J a ^ the associated 
functional. The functional J a ^ is called P -differ entiable at an L-adapted process X G C 1 (I) 
if 

(7.2) J a , b (X + Z)- J a , b (X) = dJ a , b (X, Z) + R(X, Z), 

where dJ a ^(X,Z) is a linear functional of Z G P and R(X,Z) = o(|| Z ||). 

The stochastic analogue of a stationary point is then defined by: 

Definition 7.5. — A P -stationary process for the functional J a ,b is a stochastic process 
X G 6^1) such that dJ(X, Z) = for all Z G P. 

7.3.1. The P = Q l {I) case. — Our main result is: 

Lemma 7.1. — The functional J ab defined by (7.1) is C 1 (I) -differ -entiable at any L- 

adapted process X G C 1 (7), and for all Z G C 1 (7), the differential is given by: 

(7.3) 



dJ ab (X,Z) = E 



dL f dL 

{X{u),V^X{u))-V^ ( —(X(u),V,X(u)) 



dx 



+g(Z,d v L)(b)-g(Z,d v L)(a), 



Z(u)du 



where 
(7.4) 



g(Z,d v L)(s) =E[Z(u)d v L(X(u),V^X(u))] . 



Proof. — Let X and Z be two L-adapted processes. The Taylor expansion of L gives: 
L(X + Z,V^X + Z))-L(X,V^X)) = d x L{X,V„{X))Z 



(7.5) 



which yields (7.6) by integration and (3.9). 



+d v L(X,V^X))V^Z) 
+o(\\Z\\), 



□ 



7.3.2. The P = N^J) case. — Our main result is: 

Lemma 7.2. — The functional J ab defined by (7.1) is N 1 (I) -differentiate at any L- 
adapted process X G C 1 (J), and for all Z G ^(I) the differential is given by: 



(7.6) 

where 
(7.7) 



dJ ab (X,Z) = E 



(d x L - V^d v L){X{u),V^X(u))Z{u)du 



+g(Z,d v L)(b)-g(Z,d v L)(a), 
g(Z,d v L)(s) =E[Z(u)d v L(X(u),V tl X(u))} . 
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Proof. — Let X a L— adapted process and H a Nelson-differentiable process. The Taylor 
expansion of L gives 

L(X + H,V^(X + H)) - L(X,V^(X)) = d x L(X,V^(X))H 

(7.8) +d v L(X,V lx (X))V lx (H) 

+o(\\H\\), 

which yields (7.6) by integration and (3.18). □ 

7.4. A technical lemma 

The classical derivation of the least action principle used a well known result about 
bump functions (see [5], p. 57). In the stochastic framework, we will need the following 
result: 

Lemma 7.3. — Let Y £ Vc be a complex stochastic process. IfY satisfies 

(7.9) I E [Y{u)V^Z(u)] alu = 0, 

Jo 

for all Z £ AA 1 ([0, 1]) then Y is a constant process. 

Proof. — We denote Y = Y 1 + iY 2 , where Y { £ Vr and V^Z = A, where A £ Vr. The 
equation (7.9) is equivalent to 

f*E\Yi(u)A(u)] du = 0, 
1 ' ' J E [Y 2 {u)A{u)} du = 0, 

for all A gVr such that there exists Z £ C^O, 1]) satisfying V^Z = A. 

Let Zy 1 be the process defined by 

(7.11) Z Yl (u)= Y^ds-u Fi(s)ds. 

Jo Jo 

We have Z Yl € fiP-{I) with Z(0) = Z(l) = 0. Indeed, we have 

(7.12) V lx Z(u)=Y 1 (u)- f Y^ds. 

Jo 

As a consequence, we have in our notations B = and the first equation of (7.10) reduces 



to 

(7.13) E[Yi(u)A(u)]du = E (y^u) - Y^ds^j 



du 



We deduce that Y\ is a constant process, that is for all u £ I, Y\(u) = C a.s., where C is 
a random variable. 



The same argument with the second equation of (7.10) and Zy 2 concludes the proof of 
the lemma. □ 



7.5. LEAST ACTION PRINCIPLES 
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As for the computation of the differential of functionals, we must consider two cases: 
P=e 1 (I) andP = N 1 (I). 

7.5.1. The P = C 1 (/) case. — The main result of this section is the following analogue 
of the least-action principle for Lagrangian mechanics. 

Theorem 7.1 (Global Least action principle). — A necessary and sufficient condi- 
tion for an L-adapted process to be a C 1 (I) -stationary process of the functional J a b with 
fixed end points X(a) := X a £ H et X(b) := X\, £ H is that it satisfies 



(7.14) 



dL 

dx 



(X(t),V^X(t))-V^ 



dL 

dv 



{X{t),V,X{t)) 



= 0. 



We call this equation the Global Stochastic Euler- Lagrange equation (GSEL). 

We have conserved the terminology of least-action principle even if we have no notion 
of extremals for our complex valued functional. 

Proof — We denote by I =]0, 1[. Let X £ C^I) be a solution of 

(7.15) {8 X C - V^d v L)(X{u),V^X(u)) = 0, 
then X is a AA 1 (I)-stationary process for the functional J/. 

Conversely, let X is a C 1 (/)-stationary process for the functional Ji, i.e. dJi(X, Z) = 0. 
Writing 

(8 X C - V ll d v L)(X(u),V ll X(u)) = V„Y(u), 

where 

(7.16) Y{u)= [ d x L{X(s),V^X{s))ds-d v L{X{u),V^X(u)), 

Jo 

we obtain for any Z G 6^7) with Z(0) = Z(l) = 0: 



(7.17) 



dJi(X,Z) = E 



f V^Y{u)Z(u)du 
Jo 

E[V^Y{u)Z{u)]du. 



Using the C 1 (/)-product rule (see equation 3.9), we obtain 



(7.18) 



dJ 7 (X, Z) 



E[Y(u)Vf,Z{u)]du. 



Using lemma 7.3 we obtain that Y is a constant process. 
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Hence, we have V^Y(u) = and 
(7.19) (d x £ - V^d v L)(X(u),V^X(u)) = 0, 

which concludes the proof. 

7.5.2. The P = N^i") case. — Our main result is: 



□ 



Theorem 7.2 (least action principle). — A necessary and sufficient condition for an 
L-adapted process to be a N 1 (I) -stationary process of the functional J a b with fixed end 
points X(a) := X a 6 H et X(b) := Xb £ H is that it satisfies 



dL 

(7.20) —(X(t),T>rX(t))-I> (l 



dL 

dv 



(X(t),V^X(t)) 



0. 



We call this equation the weak stochastic Euler- Lagrange equation (SEL). 

Proof. — We denote by / =]0, 1[. Let X £ C 1 ^) be a solution of 

(7.21) (8 X £ - V^d v L)(X(u),V^X(u)) = 0, 
then X is a AA 1 (I)-stationary process for the functional J/. 

Conversely, let X is a AA 1 (/)-stationary process for the functional J/, i.e. d.Jj(X, Z) = 0. 
Writing 

(d x £ - V„d v L)(X{u),V„X{u)) = V^Y(u), 

where 

(7.22) Y(u)= d x L(X(s),V^X(s))ds-d v L(X(u),V^X(u)), 

Jo 

we obtain for any Z G AT 1 (I) with Z(0) = Z(l) = 0: 



(7.23) 



dJi(X,Z) = E 



I V^Y{u)Z{u)du 
Jo 

E[V^Y{u)Z{u)]du. 



Using the AA 1 (/)-product rule (see equation 3.18), we obtain 



(7.24) 



dJi(X, Z) 



E[Y(u)V^Z{u)]du. 



Jo 



Using lemma 7.3, we deduce that Y is a constant process, that is for all u £ I, Y{u) = C 
a.s. where C is a random variable. 



Hence, we obtain V^Y(u) = and 
(7.25) (d x £ - V^d v L)(X(u),V^X(u)) = 0, 
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which concludes the proof. □ 



7.6. The coherence lemma 

It is not clear that the stochastic Euler-lagrange equation obtained by the stochastiza- 
tion procedure and the N^i") or C 1 (I) least-action principle coincide. One easily sees that 
this is not the case for P = C 1 (/). In the contrary, we have the following lemma, called 
the coherence lemma, which ensure that for P = ^(I) we obtain the same equations. 

Lemma 7-4 (coherence lemma). — The following diagram commutes : 

(7.26) L(x(t),x'(t))—^L(X(t),VX(t)) 



Least action principle 



Stochastic Least action principle 



(EL) (SEL) 

Proof. — This is an immediate consequence of the previous results. □ 

Remark 7.1. - - When fi = 0, i.e. in the reversible case, the previous lemmas and the- 
orems are true under C 1 (/) variations. Note that when \i = 0, our stochastic derivatives 
coincides with the Misawa- Yasue [52] canonical formalism for stochastic mechanics. 



CHAPTER 8 



THE STOCHASTIC NOETHER THEOREM 



A natural question arising from the stochastization procedure of classical dynamical 
systems, in particular, Lagrangian systems, is to understand what remains from classical 
first integrals of motion. First integrals play a central role in many problems like the n- 
body problem. In this section, we obtain a stochastic analogue of the Noether theorem. We 
then defined the notion of first integrals for stochastic dynamical systems. We also discuss 
the consequences of the existence of first integrals in the context of chaotic dynamical 
systems. 

8.1. Tangent vector to a stochastic process 

Let X e C 1 (7) be a stochastic process. We define the analogue of a tangent vector to 
X at point t. 

Definition 8.1. — Let X G C 1 (^), I C R. The tangent vector to X at point t is the 
random variable VX(t). 

Remark 8.1. — Of course, in order to define stochastic Lagrangian systems in an in- 
trinsic way, one must define the stochastic analogue of the tangent bundle to a smooth 
manifold. In our case, it is not clear what is the adequate geometric object underlying 
stochastic Lagrangian dynamics. For example, we can think of multidimensional Brow- 
nian surfaces ([23],%. 16.4)- All these questions will be developed in a forthcoming paper 
[17]. 

8.2. Canonical tangent map 

In the sequel, we will need the following mapping called the canonical tangent map: 
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Definition 8.2. — For all X G C 1 (I), we define the canonical tangent map as 

The mapping T will be used in the following section to define the analogue of the linear 
tangent map for a stochastic suspension of a one parameter group of diffeomorphisms. 



8.3. Stochastic suspension of one parameter family of diffeomorphisms 

We begin by introducing a useful notion of stochastic suspension of a diffeomorphism. 

Definition 8.3. — Let <f) : R n — ► R™ be a diffeomorphism. The stochastic suspension of 
<f> is the mapping <3? : V — > V defined by 

(8.2) VX € V, ®{X) t {u) = <f>(X t (u)). 

In what follows, we will frequently use the same notation for the suspension of a given 
diffeomorphism and the diffeomorphism. 

Remark 8.2. — It seems strange that we have not defined directly the notion of diffeo- 
morphism on a subset E of the stochastic processes, i.e. mapping <3? : E — > E which are 
Frechet differentiable with an inverse which is also Frechet differentiable. However, these 
objects do not always exist. 

Using the stochastic suspension, we are able to define the notion of stochastic suspension 
for a one-parameter group of diffeomorphisms. 

Definition 8.4- — A one-parameter group of transformations & s : A — > A, s £ R, where 
A C V , is called a (p- suspension group acting on A if there exist a one parameter group of 
diffeomorphisms <p s : W 1 — > W l , s € R, such that for all s £ R, we have: 

i) $ s is the stochastic suspension of 4> s , 

ii) for allX € A, & 8 (X) € A. 

This notion of suspension group comes from our framework. It relies on the fact 
that we want to understand how symmetries of the underlying Lagrangian systems are 
transported via the stochastic embedding. The non-trivial condition on the stochastic 
suspension of a one-parameter group of diffeomorphisms acting on A comes from condition 
ii). However, imposing some conditions on the underlying one parameter group, we can 
obtain a stochastic one parameter group which acts on the set E of good diffusion processes. 

Precisely, let us introduce the following class of one-parameter groups: 
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Lemma 8.1. — An admissible one parameter group of diffeomorphisms <3? = {</> s } s( =ir is 
a one parameter group of C 2 - diffeomorphisms on R™ such that 

(8.3) (s,x) ^ ^<t>s{x) is C 2 . 

The main property of admissible one parameter groups is the fact that they are well- 
behaved on the set of good diffusions. 

Lemma 8.2. — Let <3? = (0 s ) sg jj be a stochastic suspension of an admissible one param- 
eter group of diffeomorphisms. Then, for all X G E, we have for all t £ I, and all s 6 R: 



i) The mapping s i-> D^ s X(t) £ C 1 (R), (a.s.), 



ii) We have — [V^ S {X))\ = 



ds 



(a.s.). 



This lemma is trivial in the classical case where X is a smooth function and is 
the classical derivative with respect to time. Indeed, it reduces to the Schwarz lemma. 
However, this inequality plays an essential role in the derivation of the classical Noether's 
theorem (see [5], p. 89). 



Proof. — According to (2.4), 



V,MX)(t) = V,X(t) ■ ^X(t) + ifl ^l^l^X(t) (a.s.). 



So 



§-v M x m =v»x(t).§- s d -£x(t) + i » 2 ()sib . 



a(t,X t ) 2 8d 2 x 4> Sx{t) (as) _ 



d d 



ds dx 



d d 



Since (s,x) *— ► <fi s (x) is C , we have — — (f> s (x) = ——4>s(x) by the Schwarz lemma. In 
the same way, 



dx ds 



ds dx 2 s dx ds dx s 



because (s,x) i-> -§^(f>s(x) is C 2 . 
Therefore : 



d d 2 d 2 d 



ds dx 2 



dx 2 ds 



d 

dS' 



Applying (2.4) to — (j> s , we can conclude that : 



d_ 
ds~ 



[V^<j> a {X))\=V» 



dMX) 
ds 



(a.s.). 



□ 



It must be pointed out that every extension of this lemma will lead to a substantial 
improvement of the following stochastic Noether theorem. 
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8.4. Linear tangent map 

Let X G C 1 (I) and f> : W 1 —>■ W l be a diffeomorphism. The image of A under the 
stochastic suspension of (ft, denoted by <&, induces a natural map for tangent vectors de- 
noted by called the linear tangent map, and defined as in classical differential geometry 
by: 

Definition 8.5. — Let $ be a stochastic suspension of a diffeomorphism. The linear 
tangent map associated to and denoted by <!>*, is defined for all X G C 1 (/) by 

(8.4) $*(A) = T($(A)) = ($(A),P($(A))). 

All the quantities are well defined as diffeomorphisms send C 1 (I) on C 1 (I). 



8.5. Invar iance 

We then obtain the following notion of invariance under a one parameter group of 
diffeomorphisms. 

Definition 8.6. — Let <I> = {</> s } se R be a one-parameter group of diffeomorphisms and 
let L be a functional L : C 1 (/) — > 6^(7). The functional L is invariant under the one- 
parameter group of diffeomorphisms <3> if 

L((/)*X) = L(X), for all 
As a consequence, if L is invariant under <£, we have 

L(MX)MMX))) = l(x,vx), 

for all s G R and A" G C 1 ^). 

Remark 8.3. - We note that this notion of invariance under a one parameter group 
of diffeomorphisms does not coincide with the same notion as defined by K. Yasue ([71], 
p. 332, formula (3.1)) which in our notation is given by: 

L((f> s (X),(f> s (VX)) = L(X,VX), for alls em andX G C^J). 

In fact, K. Yasue definition of invariance does not reduce to the classical notion (see for 
example [5], p. 88) for differ entiable deterministic stochastic processes. 

Moreover, Yasue 's definition is not coherent with the invariance notion used in his proof 
of the stochastic Noether's theorem ([71],theorem 4,P-332). See the comment below. 



8.6. THE STOCHASTIC NOETHER'S THEOREM 
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8.6. The stochastic Noether's theorem 



Noether's theorem has already been generalized a great number of times and covers 
sometimes different statements [32]. Here, we follow V.I. Arnold's ([5], p. 88) presentation 
of the Noether theorem for Lagrangian systems. We correct a previous work of K. Yasue 
([71], Theorem 4,p. 332-333). 



Theorem 8.1. 



Let J aj b be afunctional on S 1 ^) given by 
J a>b (X) = e\ f L(X(t),VX(t))dt 

.J a 



with L invariant under the one-parameter group $ = {</> s } se ]]£. 

Let X G C 1 (I) be a C 1 (-/")- stationary point of J a b with fixed end points condition 



Then, we have 

where 
(8.5) 



X(a) = X a , and X(b) = X b . 
d 



dt 



-E 



Lad L &Y 








grad„L Qs 


s=0- 



= 0, 



Y s = <$> S {X). 



Proof. — Let Y(s, t) = <f) s X(t) for s G R and a ^ t < b. 
As L is invariant under $ = {<^> s } sg R, we have 

^L{Y{s,t),V li Y{s,t)) = (a.s.). 

As Y(.,t) and V^Y(.,t) belong to C^R) for all t £ [a,b] by definition 8.4, iii), we obtain 

(8.6) 



QY dV Y 

grad^L • — + grad^L ^ = (a.s.). 



Using (Lemma 8.2,ii), this equation is equivalent to 



(8.7) 



dY 

grad^L • — + grad^LP^ 



dY 
ds 



= (a.s.). 



As X = Y \ s= q is a stationary process for J a ^, we have 
(8.8) grad^L = £>_ M grad„L. 

As a consequence, we deduce that 

/ dY fdY\\ 

[p^gvadM • — + grad,LP M i^—jj 

Taking the absolute expectation, we obtain 



(a.s.). 



s=0 



(8.9) 



E 



dY 



[P^grad^L] • + grad„LP^ ( — 



dY 



ds 



s=0J 



= 0. 
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Using the product rule, we obtain 

d 



dt 



-E 



grad^L 



dY 
ds 



0, 



s=0J 



which concludes the proof. 



□ 



8.7. Stochastic first integrals 

The previous theorem leads us to the introduction of the notion of first integral for 
stochastic Lagrangian systems* 1 ) . 

8.7.1. Reminder about first integrals. — Let X be a C k vector field or R n , k ^ 1 
(k could be oo or w, i.e. analytic). We denote by 4> x (t) the solution of the associated 
differential equation, such that ^(0) = x and by S the set of all these solutions. 

A first integral of X is a real valued function / : R n — > R such that for all <j) x (t) € S, 
we have 

(8.10) f(Mt)) = <<c, 
where c x is a constant. 

We have not imposed any kind of regularity on the function /, so that / can be just 
C . In this case, the existence of a first integral does not impose many constraint on the 
dynamics. 

If / is at least C 1 , then we can characterize first integrals by the following constraint: 

(8.11) X-f = 0. 

8.7.2. Stochastic first integrals. — The previous paragraph leads us to searching for 
an analogue of the classical notion of first integrals as a functional defined on the set of 
solutions of a given stochastic Euler-Lagrange equation^ 2 ) and real valued. Looking for 
the stochastic Noether theorem, we choose the following definition: 

Definition 8.7. — Let L be an admissible Lagrangian system. A functional I : S 1 (I) — ► 
R is a first integral for the Euler-Lagrange equation associated to L if 

(8.12) | [I(X)] = 0, 
for all X satisfying the Euler-Lagrange equation. 



^Of course, one can extend this definition to general stochastic dynamical systems. 
^ 2 ^Of course, this definition will extend to arbitrary stochastic dynamical systems. 
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We can now interpret the stochastic Noether theorem in term of first integrals, i.e. 
the fact that the invariance of the Lagrangian L under of a one parameter group of 
diffeomorphisms <3? = (<p s )seR induces the existence of a first integral for the associated 
Euler-Lagrange equation, defined by 



s=oJ 

8.8. Examples 

8.8.1. Translations. — We follow the first example given by V.I. Arnold ([5], p. 89) for 
Noether theorem. Let L be the Lagrangian defined by 

V 2 

(8.14) L(X, V) = — - U{X), where X G M 3 , 

V = (Vi, V 2 , V 3 ) G C 3 , V 2 := V? + + V 3 2 and U is taken to be invariant under the one 
parameter group of translations: 

(8.15) (f>s{x) = x + sei, 
where {ei, e 2 , 63} is the canonical basis of M 3 . 

Then, by the Stochastic Noether's theorem, the quantity 

(8.16) EpXi] 

is a first integral since dyL = V and d s 4> s (Xi(uj)) = e±. 

8.8.2. Rotations. — We keep the notations of the previous paragraph. We consider 
the Lagrangian of the two-body problem in IR 3 , i.e. 

(8.17) L(X, V) = q(V) - ^- where q(V) = ^, 

where | . | denotes the classical norm on R 3 defined for all X G M 3 , X = (Xi, X 2 , X 3 ) by 

I X \ 2 =x 2 + x 2 + x 2 . 

We already know that the classical Lagrangian L is invariant under rotations when 
l£l 3 and V G M 3 . Here, we must prove that the same is true for the extended object, 
i.e. for L defined over R 3 \ {0} x C 3 . This extension, as long as it is defined, is canonical. 
Indeed, we define q(z) for z G C 3 as 

(8.18) q(z) = ^(z 2 l +z 2 + zl), z=( Zl ,z 2 ,z 3 )£C 3 . 

Note that our problem is not to discuss an analytic extension of the real valued kinetic 
energy but only to look for the same function on C 3 simply replacing real variables by 
complex one. As long as the new object is well defined this procedure is canonical, which 



(8.13) 



I(X) = E 



grad„L 



d^ s X(t) 



ds 
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is not the case if we search for an analytic extension of q over C 3 which reduces to q on 
R 3 . 

Our main result is then that this group of symmetry is preserved under stochastization, 
which is in fact a general phenomenon that will be discuss elsewhere. 

Lemma 8.3. — The lagrangian L defined over R 3 \{0} x C 3 is invariant under rotations 
4>o t k around the e k axis by the angle 6, k = 1,2,3. 

The proof is based on the two following facts: 

— As 4>e,k is a linear map whose matrix coefficients do not depend on t, we have 

(8.19) P M [</> e , k (X)] = fo,k [V^X] , 
where 4>g^ is trivially extended to C 3 . 

- A simple calculation gives 

(8.20) VzGC 3 , q(<j>o,k(z))=q(z). 
We easily deduce the 4>$ <k invariance of L, i.e. that 

(8.21) L(4> e , k X,V(^ k X)) = L(X,VX). 
We now compute: dg(f>e : k(X)\e=o = ejM and 

d v L(X,VX) ■ d e <t>B,k{X)\<h* = (XAVX) k . 

Therefore the expectation of the "complex angular momentum" X A T>X is a conserved 
vector (A is extended in a natural way to complex vectors). 

8.9. About first integrals and chaotic systems 

In this section, we discuss some consequences of the stochastic Noether's theorem in 
the context of chaotic dynamical systems. The study of deterministic chaotic dynamical 
systems is difficult. 

Here again, we return to the classical ra-body problem, n ^ 3. In this case, in particular 
for large n, the dynamics of the system is very complicated and only numerical results give 
a global picture of the phase space. Despite the existence of a chaotic behaviour, there 
exist several well known first integrals of the system. 

These integrals are used as constraints on the dynamics and can give interesting results, 
as for example J. Laskar's [41] approach to the Titus-Bode law for the repartition of the 
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planets in the solar systems and extra-solar systems. 

Using our approach, we can go further by claiming that such kind of integrals continue to 
exist even if one consider a more general class of perturbations including stochasticity. We 
note that this result is fundamental as long as one wants to relate numerical computations 
on the n-body problem with the real dynamical behaviour of the solar systems, and in 
this particular example, the dynamics of the protoplanetary nebulae. 



CHAPTER 9 



NATURAL LAGRANGIAN SYSTEMS AND THE 
SCHRODINGER EQUATION 



In this section, we explore in details the stochastization procedure for natural La- 
grangian systems. In particular, by introducing a suitable analogue of the action 
functional, we prove that the stochastic Euler-Lagrange equation leads to a non-linear 
Schrodinger equation, depending on a free parameter related to a normalization constraint. 
For a suitable choice of this parameter we then obtain the classical linear Schrodinger 
equation. 

9.1. Natural Lagrangian systems 

In ([5], p. 84), V.I. Arnold introduces the following notion of natural Lagrangian systems: 

Definition 9.1. — A Lagrangian system is called natural if the Lagrangian function is 
equal to the difference between kinetic and potential energy: 

L{x,v) =T{v)-U{x). 

As an example, we have the natural Lagrangian function associated to Newtonian me- 
chanics: 

L(x,v) = \ 2 - U(x), 

where U is of class C°°. 

9.2. Schrodinger equations 

9.2.1. Some notations and a reminder of the Nelson wave function. — We recall 
that Ad is the space of "good" diffusion processes. Let A 9 d be the subspace of A^ whose 
elements have a smooth gradient drift. We then set: 

S = {X eA d | V 2 X{t) = -VU{X(t))}. 
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For a diffusion X in with drift b and density function pt(x), we set: 

(9.1) 6 = (R+ x R d ) \{{t,x), | pt(x) = 0}. 

If X e A g d then there exist real valued functions R and S smooth on such that 

/ 2 2 \ 

(9.2) VX(t) = (b- yVlog(p t ) + i^-Vlog(p t )J (X(t)) = (VS + iV22)(X(t)), 



since 6 is a gradient. Obviously: 



a 2 



(9.3) i?(t, x ) = — log(p t (x)). 
In this case, we introduce the function: 

(R + iS)(t,x) 

(9.4) *(t,x)=e # 

(where K is a positive constant) called the wave function. 

The wave function has the same form than that of Nelson one (see [53]). We then set 
A = S - iR. So * = and VA(t,X(t)) = VX{t). For a suitable K, Nelson shows 
that if X satisfies its stochastized Newton equation (which is the real part of ours) then 
^ satisfies a Schrodinger equation. We show, by using our operator V, the same kind of 
result in the next section. 

9.2.2. Schrodinger equations as necessary conditions. — 

Theorem 9.1. — If X £ S n A% then the wave function (9.4) satisfies the following 
non-linear Schrodinger equation on the set Q: 

(9.5, , Kdt * + E!^im^ + ^ =u% 

Proof. — As U is a real valued function, X G S implies 

V 2 X{t) = -VU(X(t)). 
The definition of implies that on 

W 

VA = -iK . 



Since VA(t,X(t)) = (DX)(t), we obtain 

iKV^(t,X(t)) = VU(t,X(t)). 

Therefore, considering the k-th component of the last equation and using lemma 2.4, we 
deduce 

iK U d -f- +vx{t) . - 4 A ^r) = 



9.2. SCHRODINGER EQUATIONS 



87 



— 

Now VX(t) = —iK—^-(t,X(t)). Thus, by Schwarz lemma, we obtain 



3=1 3=1 V 7 



and 

3=1 3=1 V 7 3=1 V 7 



Therefore 
iKd k 



By adding an appropriate function of t in S, we can arrange the constant in x of integration 
in equation to be zero, and formula (9.5) follows as claimed. □ 

In order to recover the classical linear Schrodinger equation, we must choose the nor- 
malization constant K. The main point is that in this case, we obtain a clear relation 
between the modulus of the wave function and the density of the underlying diffusion 
process. Precisely, we have: 

Corollary 9.1. - - We keep the notations and assumptions of theorem (9.2.3). We as- 
sume that 

K = a 2 . 

Then the wave functional ^ satisfies the linear Schrodinger equation 

4 

(9.6) ia 2 d t y + yAf = UV, 

Moreover, if p t (x) is the density of the process X(t) at point x, then we have 

(VW)(t,x) =Pt(x). 

Proof. — K = a 2 kills the non-linearity in equation (9.5) and furthermore 

log(tftt) = ^R = ^R = log(p). 
which concludes the proof. □ 

9.2.3. Remarks and questions. — 

- Obviously Ai C Af since b is continuous. 

- A natural question is to know if the converse of the corollary of () is true. More 
precisely, if satisfies a linear Schrodinger equation, can we construct a process X 
which belongs to S n A^ and whose density is such that pt(x) = \^(t, x)\ 2 ? 
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R. Carmona tackled the problematic of the so-called Nelson processes and proved 
in [11] under some conditions the existence of a process X with gradient drift related 
to ^ and whose density is such that pt(x) = \^f(t,x)\ 2 . However we do not know 
if this process belongs to our space of good diffusions processes (which may turn 
to be a little restrictive class in this case), but we can prove formally, i.e. even so 
assuming that the formulae of the stochastized derivative to a function of the process 
holds, that X satisfies the Newton stochastized equation. Therefore, this leads one 
to question the extension of the derivative operator and the way it acts on a large 
class of processes. This problem will be treated in a forthcoming paper (See [18]). 

The fact that a process X satisfies the stochastized Newton equation of Nelson implies 
(D 2 — D 2 )X = (for the potential U is real). This is a general fact for diffusion with 
gradient drift. Indeed, we can prove: 



Lemma 9.1. — Let X G A^, b its drift and p its density function. Let Gi be the 
i-th column of the matrix (Gij) := (djbi — dibj). Then (D 2 — D 2 )X = if and only 
if for all t > 0, div(p t Gj) = 0. 

Thus, if X G it is clear that (D 2 - Dl)X = since the form ^ bkdk is closed 
and so G = 0. An interesting question is then to know if the converse is true. So we 
may wonder ourselves if S C A^. 

The difficulty relies on the fact that p and b are related via the Fokker-Planck 
equation, so the condition div(ptGi) = may not be the good formulation. However, 
one could use the work of S. Roelly and M. Thieullen in [61] who use an integration 
by parts via Malliavin Calculus to characterize gradient diffusion, in order to give a 
positive or negative answer to our question. 

A basic notion in mechanics is that of action (see [5], p. 60). The action associated 
to a Lagrangian system is in general obtained via the action functional. In our 
framework, a natural definition for such an action functional is given by: 

Definition 9.2. — Let A be the functional defined on [a,b] x S 1 ([a, 6]) by: 



Vt G [a, b], VX G e 1 ^), A(t,X) = E 
This functional is called the action functional. 



I L(X s ,(VX) s )ds\X t 

J a 



9.3. ABOUT QUANTUM MECHANICS 



89 



Using this action functional, we have some freedom to define the corresponding 
"action" . The natural one is defined by 

(9.8) A x (t,x) = E I L(X s ,(VX) s )ds\X t = x . 

.J a 

Usually, the wave function associated to Ax an denoted by ip is then defined as 

(9.9) i>x(t,x) = exp^C'*) . 

However, it is not at all clear that such kind of function satisfies the gradient condi- 
tion, i.e. that 

(9.10) VA{t,X{t)) =VX{t), 

which is fundamental in our derivation of the Schrodinger equation. 

However, the condition 9.10 is equivalent to prove that the real part of VX is a 
gradient, which is not at all trivial in dimension greater than two. 

9.3. About quantum mechanics 

Even if we look for dynamical systems, our work can be used in the context of the 
so-called Stochastic mechanics, developed by Nelson [53]. The basic idea is to reexpress 
quantum mechanics in terms of random trajectories. We refer to [12] for a review. 

The stochastic embedding theory can be seen as a quantization procedure, i.e. a formal 
way to go from classical to quantum mechanics. This approach is already different from 
Nelson's approach, which do not define a rigid procedure to associate to a given equation 
a stochastic analogue. Moreover, the acceleration defined by Nelson as 

(ail) „ m = ^.(*WTO 

is only a particular choice. Many authors have tried to justify this form ([59], [60]) or to 
try another one. In our context, the form of the acceleration is fixed and corresponds, as 
in the usual case, to the second (stochastic) derivative of X. As a consequence, stochastic 
embeddings can be used to provide a conceptual framework to stochastic mechanics. 
We refer to [59] where a complex valued velocity for a stochastic process is introduced 
corresponding to the stochastic derivative of X. 



However, stochastic mechanics as well as its variants have many drawbacks with respect 
to the initial wish to describe quantum mechanical behaviours. We refer to [55] and [12] 
for details. This is the reason why we will not develop further this topic. 
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STOCHASTIC HAMILTONIAN SYSTEMS 



In this part, we introduce the stochastic pendant of Hamiltonian systems for classical 
Lagrangian systems. The strategy is first to define the stochastic analogue of the classical 
momentum. We then define a stochastic Hamiltonian. However, this Hamiltonian is not 
obtained by the classical stochastic embedding procedure. This is due to the fact that the 
momentum process is complex valued. As a consequence, we must modify the procedure in 
order to obtain a coherent picture between the classical formalism and the stochastic one. 
This leads us to define the stochastic Hamiltonian embedding procedure which reflects in 
fact the non trivial character of the underlying stochastic symplectic geometry to develop. 
Having the stochastic Hamiltonian we prove a Hamilton least action principle using our 
stochastic calculus of variations. We then obtain an analogue of the Lagrangian coherence 
lemma in this case up to the fact that the underlying stochastic embedding procedure is 
now the Hamiltonian one. 



10.1. Reminder about Hamiltonian systems 

We denote by / an open interval (a, b), a < b. 

Let L : R d x M. d x R — > R be a convex Lagrangian. The Lagrangian functional over 
C\R) is defined by 

CHR) — ► C 1 ' 



^ ' x i — ► L(x,x,t). 
We can associate to L a Hamiltonian function using the Legendre transformation 
([5], p. 65). From the functional side, this induces a change of point of view, as the func- 
tional is not seen as acting on x(t), which is the so-called configuration space of classical 
mechanics, but on (x(t),x(t)) which is associated to the phase-space. This dichotomy 
between position and velocities has of course many consequences, one of them being that 
the system is more symmetric (the symplectic structure). 
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Definition 10.1. — Let L(x,v) be an admissible Lagrangian system. For all x £ C 1 , we 
denote by 

dL 

(10.2) p(x) = -^(x,x), 
the momentum variable. 

We now introduce an important class of Lagrangian systems. 

Definition 10.2. — Let L(x,v) be an admissible lagrangian system. The Lagrangian L 
is said to possess the Legendre property if there exists a function f : M. d — ► M. d , called the 
Legendre transform, such that 

(10.3) x = f(x,p), 
for all x G C 1 . 

Most classical examples in mechanics possess the Legendre property. This follows from 
the convexity of L in the second variable (see [5], p. 61-62). 

We can introduce the fundamental object of this section: 

Definition 10.3. — Let L be an admissible Lagrangian system which possesses the Leg- 
endre property. The Hamiltonian function associated to L is defined by 

(10.4) H(p,x)=pf(x,p)-L(x,f(x,p)), 
where f is the Legendre transform. 

The Hamiltonian function plays a fundamental role in classical mechanics. We introduce 
the stochastic analogue in the next section. 

10.2. The momentum process 

A natural stochastic analogue of the momentum variable is defined as follow: 

Definition 10. 4- — Let L(x,v) be an admissible Lagrangian system. For all X £ S 1 (I), 
we define the stochastic process P(t), called the canonical momentum process, by 

(10.5) P(t) = —(X(t),VX(t)). 

This definition can be made more natural using the embedding i defined from C (I) on 
Vdct and the linear tangent map introduced in chapter 8. Indeed, the momentum process 
can be viewed as a functional on X 6 C 1 (I), P : S 1 (I) — > Vc defined by (10.5). We have 

i0T & \\X£V\ ct = L(C\l)), 

(10.6) P{X) = l( P (x)), 
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where x G C 1 (I) is such that X = l{x). As by definition, we have 

(10.7) i{p{x))=p{i{x))=p(X). 

As p keeps a sense for X G S 1 (I), we extend formula (10.7) to C 1 (I) leading to definition 
10.4. 

If we assume that the Lagrangian possesses the Legendre property, then there exists a 
Legendre transform / such that for all x G C 1 , x = f(x,p). We can ask if such a property 
is conserved for the momentum process. We have: 

Lemma 10.1. — Let L(x,v) be an admissible Lagrangian system possessing the Legendre 
property. Let f be the Legendre transform associated to L. We have 

(10.8) VX(t) = f(X,P), 
for allX G &{T). 

We can now define the stochastic Hamiltonian associated to L: 

Definition 10.5. — Let L{x,v) be an admissible Lagrangian system possessing the Leg- 
endre property. The stochastic Hamiltonian system associated to L is defined by 

[ ' (P,X) Pf(X,P)-L(X,f(X,P)). 



10.3. The Hamiltonian stochastic embedding 

As in the previous chapter, we want to use the stochastic embedding procedure to 
associate a natural stochastic analogue of the Hamiltonian equations. However, we must 
be careful with such a procedure, as already discussed in chapter 4, §.4.2.2. Indeed, the 
embedding procedure does not allow us to fix the notion of embedding for systems of 
differential equations. Moreover, we must keep in mind that the principal idea behind the 
Hamiltonian formalism is to work not in the configuration space, i.e. the space of positions, 
but in the phase space, i.e. the space of positions and momenta. As the stochastic speed 
is by definition complex, this induces a particular choice for the embedding procedure in 
the case of Hamiltonian differential equations. 

Definition 10.6. — Let F : M. d x C d i— > C be a holomorphic function, real valued on 
real arguments. This function defines a real valued functional over C 1 (I) x C 1 (I), for I a 
given open interval ofR. The Hamiltonian embedding of the functional F is the functional 
denoted by Fg, defined on S 1 (I) x Vc(I) by H, i.e. 

(10.10) F s (X,P)(t)=F(X(t),P(t)). 
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We denote by Sh the procedure associating the stochastic functional F$ to F. This 
procedure reduces to change the functional spaces for F from x C l {I) to C 1 (I) x Vc- 

The main property of the Hamiltonian stochastic embedding procedure (and in fact 
it can be used as a definition) is to lead to a coherent definition with respect to the 
momentum process. Precisely, we have: 

Lemma 10.2 (Legendre coherence lemma). — Let L(x,v) be an admissible La- 
grangian system possessing the Legendre property. The following diagram commutes 

(10.11) ( XjP ) H(x,p) 



s 



(X,P)-^H(X,P) 

The proof follows essentially from the fact that the stochastic Hamiltonian embedding 
of the functional H, denoted by Hs coincide with the definition 10.5 of the stochastic 
Hamiltonian system associated to H via the Legendre transform and the definition of the 
momentum process. 



10.4. The Hamiltonian least action principle 

Using the stochastic Hamiltonian function, we can use the stochastic calculus of vari- 
ations in order to obtain the set of equations which characterize the stationary processes 
of the following functional: 

fb 

(10.12) I a>b (X,P)=E' ' 



f (P(t)VX - H(X(t),P(t)))dt 

J a 



defined on 6^7) x V c . 



In order to apply our stochastic calculus of variations, we restrict our attention to I on 
C 1 (7) x C 1 (I). The fundamental result of this section is the following: 

Theorem 10.1. — A necessary and sufficient condition for an L- adapted process (X,P) 
to be M 1 (I) -stationary process of the functional with fixed end points (X(a), P(a)) = 
(X a ,P a ) G H, (X(b), P(b)) = (Xb,Pb) G H is that it satisfies the stochastic Hamiltonian 
equations 

VX = ^(X(t),P(t)), 
(10.13) dP 

VP = - — (X(t),P(t)). 
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Proof. — We must use the weak least action principle using the process Z = (X, P) £ 
x Vc and the Lagrangian denoted by C defined on R d x C d x C d x C d by 

(10.14) C(x,p,v,w)=pv — H(x,p). 

As C(x,p,v,w) = L(x,v) formally via the Legendre transform, and L is assumed to be 
admissible, we deduce that C is again admissible. 



Let 5Z be a .A/ 1 (J) variation of the form Z + SZ = (X + X x , P + Pi), where Xi and Pi 
are AA 1 processes. 

The Euler-Lagrange equation associated to C is given by 



(10.15) 



^(Z(t),VZ(t))-V, 
dC 



dp 



(Z(t),VZ(t))-V^ 



dC 

dw 



(Z(t),VZ(t)) 



o, 

0. 



An easy computation leads to 

dH 



(10.16) 



This concludes the proof. 



ijx (z(t),vz{t))-v^p{t) = 0, 

VX(t)-^(Z(t),VZ(t)) = 0. 



□ 



Remark 10.1. — In this proof we do not need a uniform assumption on the set of vari- 
ations as the Lagrangian does not depend on the variable w. In fact, we can assume a 
variation in the direction P which belongs to C 1 (I). 



10.5. The Hamiltonian coherence lemma 

In this section, we derive the Hamiltonian analogue of the Lagrangian coherence lemma. 

Lemma 10.3 (The Hamiltonian cohrence lemma). — Let H : R d x R d — > M be an 

admissible Hamiltonian system. Then, the following diagram commutes 

(10.17) H(x(t),p(t))-^H(X(t),P(t)) 



Least action principle 



Stochastic least action principle 



(HE) i — ~ (SHE) 

The main point is that this result is not valid if one replaces the Hamiltonian stochastic 
embedding by the natural stochastic embedding that we have used up to now. We can 
keep the classical embedding procedure only when dealing with real valued versions of 
the stochastic derivative. For example, if one deals with the reversible stochastic embed- 
ding procedure, we obtain a unified stochastic embedding procedure for both Lagrangian 
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an Hamiltonian systems. We think however that as well as the complex nature of the 
stochastic derivative has a fundamental influence on the form of the stochastic Lagrangian 
equations, i.e. that we obtain the Nelson acceleration, the fact to move from S to Sh 
reflects a basic properties of the underlying stochastic symplectic geometry we must take 
into account this complex character of the speed. This problem will be studied in another 
paper. 



CHAPTER 11 
CONCLUSION AND PERSPECTIVES 



This part aims at discussing possible developments and applications of the stochastic 
embedding procedure. 

11.1. Mathematical developments 

11.1.1. Stochastic symplectic geometry. — The Hamiltonian formalism developed 
in the last part suggest the introduction of what can be called a stochastic symplectic 
geometry. An interesting construction of symplectic structures on Hilbert spaces is given 
in [34]. 

The main point here is to construct an analogue of the geometrical structure which 
puts in evidence the very particular symmetries of the Lagrangian equations in classical 
mechanics. There exists already many attempt to construct a given notion of symplectic 
geometry or at least a given geometry for stochastic processes, but they are as far as 
we know of a different nature. We refer to the book of Elworthy, LeJan and Li [44] for 
an overview. These geometries are only associated to stochastic processes and translate 
into data of geometrical nature properties of the underlying stochastic processes (like the 
Riemannian or sub-Riemannian structure associated to Brownian motions and diffusions) . 

A recent work of J-C. Zambrini and P. Lescot ([37] and [38]) deals specifically with 
symplectic geometry and a notion of integrability by quadratures. 

For a discussion of integrability in our context see section 11.1.2. 

11.1.2. PDE's and the stochastic embedding. — The stochastic embedding of La- 
grangian systems over diffusion processes lead to a PDE governing the density of the 
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solutions of the stochastic Euler-Lagrange equation. Moreover, we have defined a stochas- 
tic Hamiltonian system naturally associated to the Lagrangian. However, some classical 
PDEs, as for example the Schrodinger equation, possess an Hamiltonian formulation. This 
remark, which goes back to the work of Zakharov V.E. and Faddeev D. [72] is now an 
important subject in PDEs known as Hamiltonian PDEs (see for example [34]). As a 
consequence, we have the following situation: 



Of course the relation between the PDE and H$ is not of the same nature as the relation 
with H. 

In the sequel, we list a number of problems and questions which naturally arise from 
the previous diagram: 

- There exists a notion of completely integrable Hamiltonian PDE (see [34]). What 
about out stochastic Hamiltonian systems ? 

Assuming that we have a good notion of integrability for Hs, we have the following 
questions: 

- Are there any relations between the integrability of H and i?s? 

- Is there a stochastic analogue of the Arnold-Liouville theorem? 

- Is there a special set of "coordinates" similar to the action/angle variables? 

We note that there already exists such a notion for Hamiltonian PDEs (see [72]). 

- Is there a notion of integrability by "quadratures" ? 

In that respect, we think about Lax work [36] on the integrability of PDEs. 
11.2. Applications 

11.2.1. Long term behaviour of chaotic Lagrangian systems. — The dynamical 
behaviour of unstable or chaotic dynamical systems is far from being understood, unless 
we restrict to a very particular class of systems like hyperbolic systems or weak version 
of hyperbolicity. This question arises naturally for small perturbations of Hamiltonian 
systems for which there exists a large family of results dealing with this problem, as for 



(11.1) 
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example the KAM (Kolmogorov-Arnold-Moser) theorem, Nekhoroshev theorem and spe- 
cial phenomena like the Arnold diffusion related to the so-called quasi-ergodic hypothesis. 

Unfortunately, these results are difficult to use in concrete situations and only direct 
numerical simulations provide some understanding of the dynamics [22] . 

There exists of course ergodic theory which tries to look for weaker information on 
the dynamics than a direct qualitative approach. However, this theory leads also to very 
difficult problems when one tries to implement it, as for example in the case of Sinai 
billiard. Moreover, there is a widely opinion in the applied community that the long term 
behaviour of a chaotic systems is more or less equivalent to a stochastic process. One 
example of such opinion is well expressed in the article of J. Laskar [41] in the context 
of the chaotic behaviour of the Solar system: "Since the characteristic time scale for 
the divergence of nearby orbits in the Solar system is approximately 5 Myr, the orbital 
evolution of the planet becomes practically unpredictable after 100 Myr. Thus in the 
long term, the motion of the Solar system may be described by a random process, where 
orbits wander erratically in a chaotic zone." 

What are the arguments leading to this idea ? 

The first point is that chaotic dynamical systems are in general characterized by the so- 
called sensitivity to initial conditions, meaning that a small error on the initial condition 
leads to very different solutions. Of course, one must quantify this kind of sentence, and 
we can do that, with more or less canonicity, by introducing Lyapounov exponents and 
Lyapounov time. Whatever we do, there is a non canonical data in this, which is precisely 
to what extent we consider that two solutions are different. This must be a matter of 
choice for a given system, and cannot be fixed by any mathematical tool. In the sequel, 
we assume that a system is sensitive to initial conditions in some region R of the phase 
space, and for a given metric, if for all xo E R and all e > 0, the distance at time t between 
a trajectory starting at xo and xo + e, denoted by d(t) is W approximately given by 

(11.2) d(t) = ee t/T , 



As we already stress, we can in some situations gives a precise meaning to all this point, like for example 
in the Smale Horseshoes, but this is far to cover the wide variety of chaotic behaviour which are studied 
in the applied literature. 



100 



CHAPTER 11. CONCLUSION AND PERSPECTIVES 



where T > is the so-called Lyapounov time or horizon of predictability for the system^ 2 ' . 
For an example of such an estimate, we refer to J. Laskar [42] where he gives numerical 
evidences for the chaotic behaviour of the solar system. 

As a consequence, for t sufficiently large with respect to T, we have no prediction any 
more, or in other words, we can not assign to a given prediction a precise initial condition. 
We then have lost the deterministic character of the equations of motions. An idea is 
then to say that one musts then consider not a fixed initial condition xq, but a given 
random variable representing all the possible behaviours (kind of trajectories) one is lead 
to after a fixed time t: for example, e > being fixed, we consider all the intersections of 
trajectories starting in the disk D(xq, e) with the ball B(xq, e). We then obtain a family of 
directions. Assuming that we can compute an average over the family of such a quantity 
which obtain an averaged direction which select a given point of the ball B(xo,e). We 
then follow the selected trajectory during the time t, and continue again this procedure. 
Such a construction is reminiscent of the classical construction of the Brownian motion 
(see [30], p. 66). Of course, this programme can only be carried in some specific examples. 
We refer to the article of Y. Sinai [62] for an heuristic introduction to all these problems. 

If we agree with the previous heuristic idea, one can then ask for the following: how is 
the underlying stochastic process governed by the dynamical system ? 

We return again to the Hamiltonian/Lagrangian case. The stochastic embedding 
procedure answers precisely this question. The stochastic Euler-Lagrange equation is the 
track of the underlying Lagrangian system on stochastic processes. As a consequence, 
we can think that we are able to capture even the desired long term behaviour of the 
Lagrangian system using this procedure. 

In order to support our point of view, we suggest the following strategy: 

Consider a perturbation of a completely integrable Hamiltonian system H e (x) = h(x) + 
ef(x), with x € M. 2n for example. Let us assume that h(x) leads to a particular PDE 
under stochastic embedding, which can be well understood and solved. The long term 
behaviour of the completely integrable Hamiltonian system is trivial. This not the case 
for the stochastic analogue. What about the long term behaviour of H e ? We think that 



( 'In concrete systems, one must involve a macroscopic scale (see [21], p. 17), which bound the admissible 
size of an error on a prediction. Here, this quantity is arbitrary replaced by e. 
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it is controlled by the stochastic analogue of the unperturbed Hamiltonian. This result is 
related to a kind of stochastic stability which we must define. However, this approach can 
be tested on a wide variety of examples, in particular celestial mechanical problems. 

11.2.2. Celestial mechanics. — There exist many theories dealing with the problem 
of the formation of gravitational structures. For planetary systems this question is related 
to a long standing problem related to the "regular" spacing of planets in the Solar system. 
This problem which goes back to Kepler (1595), Kant (1755), von Wolf (1726), Lambert 
(1761), takes a mathematical form under the Titius (1766) formulation of the so called 
Titius-Bode law giving a geometric progression of the distance of the planets from the 
sun. We refer to the book of Nieto [56] for more details. Even if this empirical law fails to 
predict correctly the real distance for the Planet Pluto for example, its interest is that it 
suggests that the repartition of exoplanet orbital semi-major axes could satisfy a simple 
law. As a consequence, one searchs for a possible physical/dynamical theory supporting 
the existence of such kind of law. Moreover, the discovery of many exo-planetary systems 
can be used to test if the theory is based on universal phenomena and not related to our 
knowledge of the Solar system. 

All the actual theories about the origin of the solar system presuppose the formation 
of a protoplanetary nebula, formed by some material (gas, dust, etc ...) with a central 
body (a star or a big planet). We refer to Lissauer [43] for more details. 

Instead, we use a simplified model consisting of a large central body of mass mo with a 
large number of small bodies (m,j)j = i n , whose mass is assumed to be small with respect 
to mo- The main problem is to understand the long term dynamics of this model. 

Following the work of Albeverio S., Blanchard Ph. and R. Hoegh-Krohn ([3], see also 
[4]), we can modelize the motion of a given grain in the protoplanetary nebula by a 
stochastic process (see [3], p. 366-367), more precisely a diffusion process. The problem is 
then to find what is the equation governing the dynamics of such a stochastic process. 
Using our stochastic embedding theory, we can use the classical formulation in order to 
obtain the desired equation. This question will be detailed in a forthcoming article. 

The main idea behind stochastic modelisation is the following: 

The motion of a given small body in a protoplanetary nebula is given by the Kepler 
model and a perturbation due to the large number of number of small bodies. In [3], 
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this perturbation is replaced by a white noise. As a consequence, the movement of a 
small body is assumed to be described by a diffusion process. It must be noted that this 
assumption is related to a number of arguments, one of them being that the dynamics of 
the underlying classical system is unstable. We then return to our previous description of 
the chaotic behaviour of a dynamical system. However, using the stochastic embedding 
theory, we can try to justify the passage from a classical motion to a stochastic one 
looking at the following problem: 

Let L e = Lxepier + Pe, be the Lagrangian system describing the dynamics of our model. 
The Lagrangian Lxcpicr is the classical Lagrangian of the Kepler problem, and P e is the 
perturbation. Using the stochastic embedding theory, we can deduce two stochastic dy- 
namical systems, one associated to L e and denoted by S e and one associated to ^Kepler 
denoted by Skepier- H the previous strategy to replace the perturbative effect by a White 
noise is valid, then we must have a kind of stochastic stability between Sxepier an d S e . 
The notion of stochastic stability must be defined rigorously and be consistent with the 
stochastic embedding theory®. Why such a stability result is reasonable ? The main 
thing is that we already look in S^cpier for statistical properties of the set of trajectories 
of stochastic (diffusion) processes under the Kepler Lagrangian. There is no reason that 
the statistic of this trajectories really differs when adding a small perturbation. This is 
of course different if one look for the underlying deterministic system. All these questions 
will be studied in a forthcoming paper. 

11.2.3. Strange attractors. — Strange attractors play a fundamental role in turbu- 
lence and lead to many difficult problems. Most of the time, one is currently interested 
in the geometrical properties of attractors (Hausdorf dimension,...), special dynamical 
properties (existence of an SRB (Sibai-Ruelle-Bowen) measure [68], stability under 
perturbations....). However, focusing on a given attractor hides the fact that most of the 
time we can not predict from the equation the existence of such an attractor. This is 
in particular the case for the Lorenz attractor or the Henon attractor. These attractors 
are obtained numerically. In some models, we can construct a geometric model from 
which we can prove the existence of such a structure (this is the case for the geometric 
Lorenz model) [27]. For example, S. Smale [63] asks for an existence proof for the 
Lorenz equation of the attractor. This has been done recently by W. Tucker ([66], [67]). 
However, no general strategy exists in order to predict such an attractor. 



1 'It must be noted that there exists already several notion of stochastic stability in the literature, as for 
example Has'inskii [29], Kushner [35] and more recently Handel [28]. 
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Our idea is to use the stochastic embedding theory in order to predict the existence 
of such an object. Let us consider the Lorenz equations. These equations are not a 
Lagrangian system. However, there exits a canonical embedding in a Lagrangian system 
(see the report of M. Audin [7]). This lagrangian can then be studied via the stochastic 
embedding procedure. The solutions are stochastic processes whose density is controlled 
by a PDE. As we already explain, we expect that the long term behaviour of the system 
is coded by this PDE. As the long term dynamics of the Lorenz system if precisely 
supported by the Lorenz attractor, we think that this structure can be detected in the 
PDE (as a stationary state for example). 

We can also take this problem as a first step towards understanding the existence 
of coherent structures in chaotic dynamical systems. Moreover, the Lorenz attractor is 
widely studied and there exists a great amount of results like the existence of a unique 
SRB measure (see [67]). We can then take this example as a good system to compare 
classical methods of ergodic theory and our approach. For more problems related to the 
Lorenz attractor, SRB measure . . . , see ([69], [70]). 



NOTATIONS 



d: dimension 

(Q,A, P) a probability space 

- Stochastic processes 

- We denote by 

dX = b(t, X)dt + a(t, X)dW, (*) 

the stochastic differential equation where b is the drift, a the diffusion matrix and 
W is a c?-dimensional Wiener process defined on (£l,A, P). 

- We denote by X(t) the solution of (*) and by pt(x) its density (when it exists) at 
point x. 

- a(X s ,a ^ s ^ b): the a-algebra generated by X between a and b 

- T%: an increasing a algebra 

- Vt- an decreasing a algebra 

- E [• | B]: the conditional expectation. 

- || . || : norm on stochastic processes. 

- Functional spaces 

- Vr: real valued stochastic processes 

- Vc: complex valued stochastic processes 

_ Pdct'- the set of deterministic stochastic processes 

- Pfet- the set of deterministic stochastic processes such that X(u) is of class C k 

- A^: good diffusion processes 

- A^: good diffusion processes with a gradient drift 
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— L P (S7): set of random variables which belongs to L p 

— L 2 : the set of real valued processes which are Vt and Tt adapted and such that 



E / X?dt 



< oo. 



C 1,2 ((0, 1) x M. d ) the set of function which are C 1 in the first variable and C 2 in the 
second one. 

A/" 1 : the set of Nelson differentiable processes. 



- Operators 



— V: the gradient 

— A: the Laplacian 

— Let f(xi, . . . , x n ) be a given function. We denote by d x J the partial derivative of / 
with respect to Xi 

— Let f(xi, . . . , x n , 2/1, ... , y m ) be a given function. We denote by d x f, x = (x±, . . . , x n ) 
the partial differential of / in the direction x. 

— D: Nelson forward derivative 

— D*: Nelson backward derivative 

— V: the stochastic derivative 

— D n , D", V n : the n-th iterate of D, or V 

— d and d*: adapted forward and backward derivative 

k ^ 1 

— C k : the set of real valued processes which are Vt and Tt adapted and such that T> % 
exists, 1 ^ i ^ k. 

— C^.: the set of complex valued processes which are Vt and Tt adapted and such that 
V % exists, 1 ^ i ^ k. 



- Re(z): real part of z G C. 

— lm(z): imaginary part of z G C. 
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